Skip to content

Latest commit

 

History

History
59 lines (44 loc) · 1.9 KB

README.md

File metadata and controls

59 lines (44 loc) · 1.9 KB

PDF to Markdown Token Counter

A sophisticated Python web application for extracting, analyzing, and rendering GitHub repository content with advanced multi-model token counting capabilities.

Features

  • Real-time repository content analysis
  • Comprehensive token counting and cost estimation
  • Support for multiple AI model pricing:
    • o1 ($15.00 per 1M input tokens)
    • o1-mini ($3.00 per 1M input tokens)
    • Claude 3.5 Sonnet ($3.00 per 1M input tokens)
    • GPT-4o ($2.50 per 1M input tokens)
    • Gemini 1.5 Pro ($1.25 per 1M input tokens)
    • Gemini 1.5 Flash ($0.075 per 1M input tokens)
  • Clean markdown output with file structure
  • Color-coded pricing display
  • Easy-to-use web interface

Installation

  1. Clone the repository:
git clone https://github.com/U-C4N/pdftomarkdown.git
cd pdftomarkdown
  1. Install dependencies:
pip install flask gitpython
  1. Run the application:
python app.py

Usage

  1. Open your web browser and navigate to http://localhost:5000
  2. Enter a GitHub repository URL
  3. Click "Clone Repository" to analyze
  4. View the generated markdown and token costs

Technology Stack

  • Python 3.x
  • Flask
  • GitPython
  • JavaScript
  • HTML/CSS

Author

Umutcan Edizaslan: TutTrue

License

This project is licensed under the MIT License - see the LICENSE file for details.