GitHub - qtu-UBC/LCI-Data-AIPrep: Use AI to prepare life cycle inventory data

Creating an automated, high-throughput method to extract relevant information from documents, such as peer-reviewed life cycle assessment (LCA) articles and technical reports, is crucial for advancing life cycle inventory (LCI) modeling. Large Language Models (LLMs) can efficiently curate large datasets from various sources, including text descriptions, tabulated data, knowledge graphs, and images. This project aims to create an end-to-end, LLM-based LCI data curation framework. Key steps of this framework include:

Detect and partition the key elements (e.g., tables, images) from a given pdf
Embed and persist the elements into a vector database
Apply hybrid search to retrieve the relevant information for (1) system boundary completion, (2) inventory data (flow name and quantity) synthesis, (3) assumption validation
Output the curated LCI data

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
input		input
output		output
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
post-processing.py		post-processing.py
requirements.txt		requirements.txt
test_agentic_RAG.py		test_agentic_RAG.py
test_chunk_by_title_pdf.py		test_chunk_by_title_pdf.py
test_lci_info_extract.py		test_lci_info_extract.py
test_multimodal_rag.py		test_multimodal_rag.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

qtu-UBC/LCI-Data-AIPrep

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages