Ethiopian Medical DataWarehouse using YoloV5

A comprehensive data warehouse solution for Ethiopian medical business data scraped from Telegram channels, including data scraping, object detection with YOLO, and ETL/ELT processes.

Screenshot that shows the FastAPI Call

Project Directory Structure

The repository is organized into the following directories:

.github/workflows/: Contains configurations for GitHub Actions, enabling continuous integration and automated testing.
.vscode/: Configuration files for the Visual Studio Code editor, optimizing the development environment.
app: Contains the implementation of the machine learning model API, allowing interaction with the model through RESTful endpoints.
notebooks/: Jupyter notebooks used for tasks such as data exploration, feature engineering, and preliminary modeling.
scripts/: Python scripts for data preprocessing, feature extraction, and the implementation of the credit scoring model.
tests/: Unit tests to ensure the correctness and robustness of the implemented model and data processing logic.

Installation Instructions

To run the project locally, follow these steps:

Clone the Repository:

git clone https://github.com/epythonlab/EthiomedDataWarehouse.git
cd EthiomedDataWarehouse

Set up the Virtual Environment:

Create a virtual environment to manage the project's dependencies:

For Linux/MacOS:
```
python3 -m venv .venv
source .venv/bin/activate
```
For Windows:
```
python -m venv .venv
.venv\Scripts\activate
```
Install Dependencies:

Install the required Python packages by running:
```
pip install -r requirements.txt
```

Tasks

Task 1: Scraping Data from Telegram Channels

Navigate to the scripts/ directory and run telegram_scraper.
Ensure that the required libraries are installed and store the API ID and hash in the .env file.
Next, run data_cleaner.py to auto-clean the data.
Once cleaned, run store_data.py.
Ensure you create a database in your PostgreSQL database and store credentials in the .env file, then start the PostgreSQL server.

Task 2: Data Transformation using DBT

Go to the ethio_medical_project directory and explore the DBT configurations.
Run the DBT commands:
```
dbt run
```

Testing and documentation:

dbt test
dbt docs generate
dbt docs serve

Task 3: Object Detection using YOLO

Setting Up the Environment: Ensure you have the necessary dependencies installed, including YOLO and its required libraries (e.g., OpenCV, TensorFlow, or PyTorch depending on the YOLO implementation).
```
pip install opencv-python
pip install torch torchvision  # for PyTorch-based YOLO
pip install tensorflow  # for TensorFlow-based YOLO
```

Downloading the YOLO Model:

git clone https://github.com/ultralytics/yolov5.git
cd yolov5
pip install -r requirements.txt

Once you are installed the yolo model, go to the notebooks directory and Run the notebook to check the outputs and explore the PostgreSQL database for stored data.

Task 4: FastAPI Integration

Make sure you are in the root directory and run this command:
```
uvicorn app.main:app --reload
```
Note: Ensure that all the required libraries are installed. You can install any missing dependencies manually using requirements.txt.

Contributing

We welcome contributions to improve the project. Please follow the steps below to contribute:

Fork the repository.
Create a new branch for your feature or bugfix.
Submit a pull request with a detailed explanation of your changes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ethiopian Medical DataWarehouse using YoloV5

Screenshot that shows the FastAPI Call

Project Directory Structure

Installation Instructions

Tasks

Task 1: Scraping Data from Telegram Channels

Task 2: Data Transformation using DBT

Task 3: Object Detection using YOLO

Task 4: FastAPI Integration

Contributing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
.vscode		.vscode
app		app
ethio_medical_project		ethio_medical_project
notebooks		notebooks
scripts		scripts
tests		tests
.gitignore		.gitignore
README.md		README.md
README_tmp.html		README_tmp.html
image.png		image.png
requirements.txt		requirements.txt

epythonlab/EthiomedDataWarehouse

Folders and files

Latest commit

History

Repository files navigation

Ethiopian Medical DataWarehouse using YoloV5

Screenshot that shows the FastAPI Call

Project Directory Structure

Installation Instructions

Tasks

Task 1: Scraping Data from Telegram Channels

Task 2: Data Transformation using DBT

Task 3: Object Detection using YOLO

Task 4: FastAPI Integration

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages