Improvements for maintainability #24

amrit110 · 2025-01-29T14:54:44Z

vector-inference is starting to expand to more models and has onboarded users. I'm looking to contribute by helping improve its maintainability. Concretely, I have 4 tasks in mind:

Reduce maintenance overhead by having dependabot submit security updates and dependency updates as long as they conform to the requirements of the library
Add a docs build to create a webpage to help users
Improve the dev build tooling (migrate to uv from poetry)
Add unit tests for the python code in the vec-inf package.
Automate docker image build process and pushing to dockerhub/GCR repo

amrit110 · 2025-01-29T14:55:46Z

#23 for dependabot configuration

amrit110 · 2025-01-29T21:27:22Z

#28 for migrating to uv

amrit110 · 2025-02-03T17:18:34Z

#30 for docs build

jwilles · 2025-02-03T23:13:53Z

One issue that I will add for discussion - VI is slow to onboard new models because of the strong coupling between code and configuration. We currently bundle the job config with the VI package. This means that users need update their VI package to add a few lines to their models.csv which doesn't feel great.

jwilles · 2025-02-18T19:47:18Z

@XkunW We are also missing developer documentation. The user facing documentation is relatively clear but it would be good to expand the documentation with a contribution guide.

amrit110 · 2025-02-18T22:54:43Z

@jwilles, I have submitted a PR #42 that addresses the issue you raised. I have added you to review as well. I haven't actually tested a custom model yet on the cluster. I will do that shortly to make sure it works.

amrit110 · 2025-02-19T15:08:18Z

Update @jwilles, @XkunW I tested a custom model - https://huggingface.co/Qwen/Qwen2.5-7B-Instruct-1M. Works! I will be updating the documentation shortly on how users can onboard their own models.

rohan-uiuc · 2025-03-01T21:57:40Z

I like this package and quite interested to make it work for my use case, just want to confirm if there are any plans to support the following use cases?

Programmatic API access instead of json mode in cli
Cluster specific information fetched from a config file, for eg. .sif file location, module loads for cuda/cudnn, partition used for models, gpus per model as this can differ based on partition/gpu type
Support for multiple clusters, maybe it can be made possible using the same config I mentioned in point 2

Thanks for your work, it's quite useful :)

amrit110 · 2025-03-02T02:45:30Z

@rohan-uiuc thanks for your interest. Can you elaborate on 1. and give an example usage? If I understand it better, perhaps it can be a feature request for us. Regarding 2. and 3. I think my colleague @jwilles might have a better idea.

rohan-uiuc · 2025-03-02T19:16:32Z

@amrit110 thanks for the quick response. About 1, it's about the ability to enable using this package natively via python scripts/applications. I was playing around yesterday, let me open a PR for your reference. I didn't have a great way to test as it will need several manual changes to make it work on my cluster, which I might attempt in the coming week.

will be most useful for any devs like me who have access to different clusters and can get the package up and running with minimal changes to a single config file. Would love some help there.

rohan-uiuc · 2025-03-02T19:21:20Z

I've opened #54 . Please feel free to leave comments or make improvements as I cannot manually test it yet. I will be happy to contribute with some more features like adding an apptainer .def and automatically building it if .sif is not found. But I would need point 2. as a prerequisite to able to test my contributions.

XkunW · 2025-03-03T17:46:59Z

I've opened #54 . Please feel free to leave comments or make improvements as I cannot manually test it yet. I will be happy to contribute with some more features like adding an apptainer .def and automatically building it if .sif is not found. But I would need point 2. as a prerequisite to able to test my contributions.

@rohan-uiuc Thanks for your contribution! Regarding point 2 and 3, we have plans on the roadmap to decouple the config with our current cluster setup for better generalizability, so stay tuned on that. We're also in the process of automating the deployment of the singularity container (.sif file). The singularity container is only one way of using vector inference, python venvs are also accepted.

amrit110 self-assigned this Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements for maintainability #24

Improvements for maintainability #24

amrit110 commented Jan 29, 2025 •

edited

Loading

amrit110 commented Jan 29, 2025

amrit110 commented Jan 29, 2025

amrit110 commented Feb 3, 2025

jwilles commented Feb 3, 2025

jwilles commented Feb 18, 2025

amrit110 commented Feb 18, 2025

amrit110 commented Feb 19, 2025

rohan-uiuc commented Mar 1, 2025

amrit110 commented Mar 2, 2025

rohan-uiuc commented Mar 2, 2025

rohan-uiuc commented Mar 2, 2025

XkunW commented Mar 3, 2025

Improvements for maintainability #24

Improvements for maintainability #24

Comments

amrit110 commented Jan 29, 2025 • edited Loading

amrit110 commented Jan 29, 2025

amrit110 commented Jan 29, 2025

amrit110 commented Feb 3, 2025

jwilles commented Feb 3, 2025

jwilles commented Feb 18, 2025

amrit110 commented Feb 18, 2025

amrit110 commented Feb 19, 2025

rohan-uiuc commented Mar 1, 2025

amrit110 commented Mar 2, 2025

rohan-uiuc commented Mar 2, 2025

rohan-uiuc commented Mar 2, 2025

XkunW commented Mar 3, 2025

amrit110 commented Jan 29, 2025 •

edited

Loading