Skip to content

TIBHannover/climate-knowledge-graph

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

Climate Knowledge Graph

A publishing knowledge graph for UN IPCC reports

Climate Knowledge Graph is a service for structuring large climate change corpora to be machine readable, for re-publishing, and AI LLM search — all using open science methods.

#ClimateKG 🌏🌍🌎

Climate Knowledge Graph is an R&D project hosted at TIB – Leibniz Information Centre for Science and Technology and organised partnership with #semanticClimate.

Git repository: https://github.com/TIBHannover/climate-knowledge-graph

Project lead: Simon Worthington, Open Science Lab, TIB, Hanover – e-mail: [email protected] | Mastodon: https://openbiblio.social/@mrchristian

Project status: 12 month development phase to start in May 2025 supported by TIB.

Climate Knowledge Graph mission is to support the dissemination of the IPCC reports.

The IPCC is the Intergovernmental Panel on Climate Change and is a body of the United Nations. The IPCC Reports are one of the definitive climate change science and policy knowledge sources – which map out pathways into a future where the harmful effects of climate change are addressed.

Currently IPCC Reports are published as PDF and, with some reports available as webpages.

The Climate Knowledge Graph (ClimateKG) project will create a knowledge graph of the reports, initially with the open access parts of IPCC Sixth Assessment Report (AR6), to provide two open web resources for others to use:

  • firstly, a modern open web index for searching corpora and AI LLM use, and:
  • secondly a publishing engine that can package search results.

A knowledge graph is a database that precisely describes entities and relationships, enabling search and logical reasoning, for example if the following question was asked:

‘How to design a city climate action plan to mitigate against extreme weather such as floods and fires — what are the mitigation policy options — and where are the authoring scientists geographically based?’.

Using a knowledge graph you would be able to return a search result that gives links to all the related IPCC Report chapters, but also provide a list of authors, their global locations, and the related research paper citations used in the report — all as neat search results as a full text publication package. The derivative publications would be automatically typeset, available multi-format, and as semantically marked up outputs. See the prototype: IPCC Reports and City Climate Change Plans: Proof of concept prototype - Open Climate Reader.

ClimateKG specialised in semantic and linked open data enrichment of large scale fixed scientific corpora using RDF/Semantic Web design models and Wikibase/data technology to create model open science based indexing and cataloguing.

Roadmap

Task area Status Link Q1 Q2 Q3 Q4
1. AR6 report as semantified HTML w/IDs Alpha Git X
2. Table of contents of AR6 70 chapters Alpha Git X
3. IPCC Glossary (800 terms) to Wikibase (WB) Done WB X
4. Data relationship model Pre-alpha WB X X
5. Infrastructure Done NA
6. Publishing pipeline Production Git X
7. ClimateKG - Index Service (dev) Prototype (PoC) WB X X
8. ClimateKG - Publishing Service (dev) Prototype (PoC) Git X X
9. Software: Dictionaries, machine learning, etc Production JN X
10. PDF/Web to HTML Corpus Transformer Prod./Custom Git X X
11. AI LLM RAG Corpus use Evaluation TBC X X X

Background

ClimateKG comes directly out of the five year old #semanticClimate (#sC) open research group founded by Dr. Gitanjali Yadav of the National Institute of Plant Genome Research (NIPGR), Delhi, Dr Peter Murray-Rust of Cambridge University, and Simon Worthington (TIB) which works on software tool development for semantic enrichment. #semanticClimate is active on a daily basis as a community and NIPGR supports an India wide internship programme, hackathon series, and youth outreach programme. Additionally #sC presents globally from Beijing, Montevideo, to Berlin.

TIB is one of the largest science libraries in the world and is a global hub for knowledge graph R&D — service development, and infrastructure provision — especially the Open Research Knowledge Graph (ORKG). ClimateKG partners with and is supported in knowledge graph expertise by Lab Knowledge Infrastructures led by Dr Markus Stocker. At TIB ClimateKG is based in the Open Science Lab and makes use of expertise from the NFDI4Culture (cultural heritage consortium of the larger German National Research Data Infrastructure Consortium) projects: Wikibase4Research, Computational Publishing Service, and Antelope (terminology service).

Thank you for support and contributions to TIB colleagues and #semanticClimate members, volunteers, interns, and hackathon participants.

About

A publishing knowledge graph for UN IPCC reports

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published