Adding a project for GSoC 2024 #6300
Replies: 2 comments 6 replies
-
Hi @kshitijdshah99, thanks for the interested on the project. However, after internal discussion, we decide not to list the workchain/querybuilder ML/LLM project. I put the latest version of the project idea below, if you are interested, the proposal is always welcome. Please notice the deadline of the student application is 2nd April, and before that we are willing to see the potential of student by some small contributions to the community or some online discussion on the topic. Project 2 - Training an LLM to generate workchain and query from natural language promptslevel advanced Expected Size 350h In AiiDA, WorkChain is a robust workflow capable of recovering from interruptions during execution, making it ideal for handling expensive calculations. However, mastering its usage requires familiarity with certain concepts before one can confidently create a resilient workchain. Same for the This project aims to fine-tuning a large language model (LLM) to allow users to boilerplate WorkChain or build query they are interested in by expressing their desired workflow or data in a few sentences. Generating a diverse dataset of query/workchain prompts and corresponding Python code, and using this data to improve a dedicated LLM will hopefully make the produced AiiDA queries/workchains more accurate, creating a powerful tool for users to build workchain and extract the results they are interested in. Expected outcomesAt the end of the project, we aim to have a lightweight tool that can:
This will require:
Skills
NoteThis project poses an exciting challenge for both students and mentors. |
Beta Was this translation helpful? Give feedback.
-
I had recently viewed that AiiDA had came up with a very cool project regarding training a Transformer from Scratch can we open up that project for GSoC 2024 AiiDA once again
Beta Was this translation helpful? Give feedback.
All reactions