Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: hackathon/kg model hack Jack Saleh Sandeep (Team Galway) #137

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

SandeepRed
Copy link

For authors

Description

  1. Get species descriptions(get_descriptions_fromPdf.py) and save JSON file**(descriptions_output.json)** to and identify GO terms related to the article(extract_relevant_GOTerms.py).
    Ideally would embed the whole text from article and vector search for GO nodes
  2. Retrieve associated gene/protein nodes by linking them to the selected GO terms from PrimeKG and embed (embed_genes.py).
  3. Perform semantic search using FAISS on OpenAI embeddings to find the most relevant Entrez Gene IDs based on query descriptions**(embed_descriptions_search_ncbi.py)**
    Final output: species_gene_matches.csv

Sandeep Chenna added 2 commits March 7, 2025 12:15
@gurdeep330 gurdeep330 changed the title Feat/kg model hack Jack Saleh Sandeep feat: hackathon/kg model hack Jack Saleh Sandeep (Team Galway) Mar 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants