Skip to content

Latest commit

 

History

History
71 lines (62 loc) · 5.14 KB

TechReading.md

File metadata and controls

71 lines (62 loc) · 5.14 KB

Tech papers on Current Reading List

Software Management Plan / SE

  1. ELIXIR-STEERS M3.2 Report: First version of Software Management Plan interface implemented in Data Stewardship Wizard
  2. machine-actionable Software Management Plan Ontology (maSMP Ontology)
  3. Usage guidance (aka profiles) for the machine-actionable Software Management Plan Ontology
  4. https://doi.org/10.37044/osf.io/k8znb
  5. D4.4 - Guidelines for recommended metadata standard for research software within EOSC
  6. Intelligent analysis for software data: research and applications
  7. What Do We (Not) Know About Research Software Engineering?
  8. The Research Software Encyclopedia: A Community Framework to Define Research Software
  9. Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware
  10. Making Biomedical Research Software FAIR: Actionable Step-by-step Guidelines with a User-support Tool
  11. Ten quick tips for building FAIR workflows
  12. Understanding Fairness in Software Engineering: Insights from Stack Exchange Sites
  13. Scicodes
  14. Nine Best Practices for Research Software Registries and Repositories:A Concise Guide
  15. Citation File Format (CFF) vs BibTeX Conversion
  16. Software Heritage
  17. https://edsbook.org/notebooks/about - also mentions RO-Crate, https://drive.google.com/file/d/1INJBUfC_YZf9qVtaZ_lMpSayaE2SXF5s/view
  18. https://www.researchobject.org/ro-crate/
  19. https://www.rohub.org/
  20. RO-Crate
  21. Dias: Dynamic Rewriting of Pandas Code
  22. In Database Data Imputation : https://doi.org/10.1145/3639326
  23. DoppelGanger++: Towards Fast Dependency Graph Generation for Database Replay https://doi.org/10.1145/3639322
  24. Machine Unlearning in Learned Databases: An Experimental Analysis https://doi.org/10.1145/3639304
  25. Determining the Largest Overlap between Tables : https://doi.org/10.1145/3639303
  26. Modeling Shifting Workloads for Learned Database Systems : https://doi.org/10.1145/3639293
  27. Controllable Tabular Data Synthesis Using Diffusion Models : https://doi.org/10.1145/3639283
  28. Spruce: a Fast yet Space-saving Structure for Dynamic Graph Storage : https://doi.org/10.1145/3639282
  29. Optimizing Dataflow Systems for Scalable Interactive Visualization : https://doi.org/10.1145/3639276
  30. LIT: Lightning-fast In-memory Temporal Indexing : https://doi.org/10.1145/3639275
  31. Optimizing Nested Recursive Queries : https://doi.org/10.1145/3639271
  32. https://github.com/earthlab/earthpy/blob/main/.zenodo.json

Data Versioning

  1. DVC: Data Version Control - Git for Data & Models

Example Project : https://github.com/binzzheng/DVC-PyTorch

Software Citations

  1. Journal Production Guidance for Software and Data Citations
  2. Software Citation Principles
  3. citation-file-format
  4. cffinit
  5. Citation File Format - Status and current challenges
  6. How to cite and describe software
  7. Software Citation ; datacite.org
  8. Citation File Format 2021

Tech Bloggers (I like)

  1. https://third-bit.com/ideas/research/
  2. https://jzhao.xyz/
  3. https://vickiboykis.com/

General SE

  1. https://arxiv.org/abs/2310.10817
  2. Knowledge Graph
  3. https://wholetale.org/, https://github.com/whole-tale