-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dmp/619 : Added gen_project service #83
Conversation
…kflow YAML/JSON" This reverts commit 95ef1fc.
…oject workflow YAML/JSON
@josephjclark can you help me in explaining what exactly the script requirement is for files to be imported to lightning ? As mentioned in the issue ticket (OpenFn/kit#619). The demo should be executable from a terminal. It could be a python or bash script. It should not be integrated with the the OpenFn CLI.
I want to know what exactly Lightning project structure on file system is because as per my understanding it's the Additionally, if you have any suggestions or improvements for this ? |
Hi @Pin4sf Your mentor should help you out with the review and moving the project forward from here on. Well be giving high-level guidance from a distance but I am not directly involved in the project. We're just preparing some documentation to help you with the lightning import stuff. The first step is for you to able to import your generated yamls into Lightning. Once you've got all that working, it's up to you to work out how to build a demo to allow a non-technical openfn contributor to generate and visualize a workflow (or maybe, if we have to, just visualize a pre-generated workflow). There are many ways to do this. A very simple CLI command or single script is acceptable. Once you understand the deploy process, that should all make a lot more sense. And if you need more help at that point then we'll set up a call. |
Instructions for setting up Lightning are coming soon - we've been very busy and struggling time to prepare material. In the meantime, there are a LOT of files in this PR. Do we need them? Can they be bundled or compressed? What strategies can we employ to have less data in both the repo and the deployed docker image? |
Okay no problem thanks for updating me regarding this .
Yeah those files are for NER model which is used for adaptor identification. I have made the complete model available with AI agent so that it could work offline also but I will look into it try to find a way to have the amount of data reduced while still getting good YAML/JSON generation. |
@Pin4sf ok, let's try and focus this week on making the ner model files more manageable. And if there really isn't an answer, what are the alternatives? This may not be easy but it's important to us to have a viable solution. So let's take the time to get it right! |
Hi @Pin4sf, how are you getting on with the model files? I've got some resources for you on the demo setup. We run a public demo version of our app at demo.openfn.org. You can log in with username We want you to create a local demo - as a python or bash script, whatever you like, but it has to be easy to run - which will take a prompt (a list of workflow steps), generate a workflow yaml to disk, and publish that yaml straight to demo.openfn.org. That makes it easy for both us AND you to actually visualise the workflows you're generating with the service. @elias-ba has prepared a tutorial video showing you details of how to use our CLI to "deploy" a project.yaml to the demo app: https://www.loom.com/share/76db8ec7e4ab4d81b73c65d0019078f4?sid=9845e902-1ee5-4e8e-9e2d-e9650bc89f98 |
Hey @josephjclark, Thank you very much for the resources and updates regarding lightning import. This is really helpful.
I have reviewed the method I am using for file generation and found that my YAML/JSON structure generation primarily relies on rule-based parsing and regular expressions to assemble the file structure. Therefore, there was not much need for a specialized NER model for Adopter identification. Instead, adopters can be directly referenced from the list I had already updated in the codebase. We can use the existing NER model present in the root of the Apollo repository in the model directory. This approach allows the AI agent to function the same as it did with a dedicated model. |
Hi @Pin4sf, how are you getting on with the demo setup? I expect that to be where you're spending your time. If you need review or feedback you should first let your mentor know. If there's no other direction you can contact me directly. I've put together a couple of sample workflow descriptions for you, based on some real work we're doing at the moment. I think these are all quite hard in different ways - I'm not expecting perfect results (there isn't enough information), but I am hoping for a good starting point.
You might want to think about whether and how we can feed adaptor specific knowledge into the service. For example it would be useful to know that Satusehat can semd FHIR resources and webhooks are triggered with the openfn adaptor. How can we encode this information? |
Hello @josephjclark, I am currently working on setting up a demo to be imported on Lightning. However, I am encountering an issue where, after deploying the project.yaml and exporting my API credentials to the demo app endpoint (following the video instructions), I am getting an authorization error. Could you help me identify why the authorization is failing? I am attaching the current midpoint update here. There are a few doubts I have:
Thank you for giving me these test cases to work on. I will keep you updated on my progress. |
Ah it turns out the 403 is my fault - try and sign in with I'm afraid I don't understand your linter (why do you need to lint a file you've just generated?) and I won't have time to look into it this week. If your web service returns the I cannot give you a dataset of input steps. It is too broad. Better to focus on the principle. How would you like to encode knowledge about adaptors? What options do we have? We do have a separate RAG service in development which allows documentation to be pulled from the doc site and added to a prompt. Would that facility be useful? Let me put it this way: the knowledge base about our adaptors is docs.openfn.org/adaptors. Can you use that knowledge base to enhance the abilities of the generator? |
Hey @josephjclark Hope you are doing well. Sorry for late reply I had some university stuff. These are the current outputs can you please verify that these are the desired output you are looking for ?
|
@Pin4sf We would like to verify the outputs by looking at them through Lightning. It is hard to debug the yaml directly. How are you getting on with the demo? A good demo will allow you to prove to us that you've completed the task, and give us a reasonable chance at assessing the quality of your work. It is a vital part of the assignment, and should have been your focus on this project for the last six weeks. If you are having difficulty (which wouldn't surprise me, it's not an easy assignment), who have you contacted for support? |
I am preparing for the demo itself. This was quite hard for me also as a college student and learning development I was facing various issues but got them resolved somehow and got to learn new things in the process. |
Hey @Pin4sf learning when and where to ask for help is one of the most important dev skills of all. Just make sure to keep practicing it. If we can't answer, we won't! But usually we can at least give you a nudge in the right direction. |
Thank you for your efforts here @Pin4sf . We'll take over the demo side of things and try and work out how to integrate this into our main app. |
Thank you very much to all the maintainers who helped me throughout my mentorship program. I've learned so much about open source development and feel much more confident in my abilities. It was a privilege to work on this issue, and I hope to contribute more to the project in the future! |
[DMP 2024] Generate a project.yaml file from a list of steps #619
An AI agent that is able to generate a workflow for a user, based on an instruction specifying the required steps and adaptors.
As specified in the issue (OpenFn/kit#619)
I have added the service
gen_project
which have a Python module to generate YAML/JSON files from workflow steps using NLP, rule-based parsing and NER model for adopter identification.Deliverables
gen_project
project.yaml
orworkflow.json
fileThis PR includes the first 3 deliverables completely. Still working a demo script which lets user easily test various inputs and see them visualised in Lightning.
Demo Script
Current demo script to use the
gen_project
service.Usage
Send a POST request to
/services/gen_project
with a JSON body containing the workflow steps and the desired output format.CLI Call
tmp/input.json
with the path to the input file andtmp/output.json
with the path to the output file, or you can create tmp directory in the root of the project and run the command as is.Demo
A demo video showcasing cli call and service usage for reference.
Untitled.video.-.Made.with.Clipchamp.mp4