A sophisticated Python web application for extracting, analyzing, and rendering GitHub repository content with advanced multi-model token counting capabilities.
- Real-time repository content analysis
- Comprehensive token counting and cost estimation
- Support for multiple AI model pricing:
- o1 ($15.00 per 1M input tokens)
- o1-mini ($3.00 per 1M input tokens)
- Claude 3.5 Sonnet ($3.00 per 1M input tokens)
- GPT-4o ($2.50 per 1M input tokens)
- Gemini 1.5 Pro ($1.25 per 1M input tokens)
- Gemini 1.5 Flash ($0.075 per 1M input tokens)
- Clean markdown output with file structure
- Color-coded pricing display
- Easy-to-use web interface
- Clone the repository:
git clone https://github.com/U-C4N/pdftomarkdown.git
cd pdftomarkdown
- Install dependencies:
pip install flask gitpython
- Run the application:
python app.py
- Open your web browser and navigate to
http://localhost:5000
- Enter a GitHub repository URL
- Click "Clone Repository" to analyze
- View the generated markdown and token costs
- Python 3.x
- Flask
- GitPython
- JavaScript
- HTML/CSS
This project is licensed under the MIT License - see the LICENSE file for details.