Skip to content

Dsadd4/MS-mol

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 

Repository files navigation

Awesome-SpectraAI-Resources

Awesome SpectraAI Resources

✨✨ A curated collection of resources on artificial intelligence for spectral data analysis, covering computational methods for mass spectrometry (MS), NMR, IR, and XRD data.


1. Mass Spectrometry (Small Molecules)

1.1 Forward Task (Molecule → Spectrum)

Computational approaches for predicting mass spectra from molecular structures

1.2 Inverse Task (Spectrum → Molecule)

AI methods for molecular identification and elucidation from mass spectra


2. Mass Spectrometry (Peptides)

2.1 Forward Task (Peptides → Spectrum)

Computational methods for predicting peptides mass spectra

2.2 Inverse Task (Spectrum → Peptides)

AI approaches for peptides identification and quantification


3. NMR Spectroscopy (Small Molecules)

3.1 Forward Task (Molecule → Spectrum)

Prediction of NMR spectra from molecular structures

📊 Forward Task Method Table

Paper Title & Link Method Type Data Source Performance Metric Notes
Prediction of chemical shift in NMR: A review Empirical - Rule-based Interpretable, less generalizable
iShiftML: Highly Accurate Prediction of NMR Chemical Shifts Hybrid ML + QM QM descriptors < 0.2 ppm error Fast inference, needs QM feature prep
NMR shift prediction from small data quantities ML NMRShiftDB2 MAE (ppm) Good scalability
NMR-spectrum prediction for dynamic molecules ML-Dynamics Simulated ensembles Time-avg ppm Accounts for flexible molecules
Machine learning in NMR spectroscopy DL NMRShiftDB2 TBD Multitask joint learning

3.2 Inverse Task: Spectrum → Molecule

📊 Inverse Task Method Table

Paper Title & Link Method Type Input Data Accuracy / Metric Notes
A Bayesian approach to structural elucidation using crystalline-state solid‑state NMR and probabilistic inference (2019) Bayesian Solid‑state NMR Top‑5 accuracy Requires crystal information
Accurate and efficient structure elucidation from routine one‑dimensional NMR spectra using multitask machine learning (2024) DL (CNN + Transformer) 1D spectra Top‑1 ~70% No need for 2D spectra
Deep reinforcement learning and graph convolutional networks for molecular inverse problem of NMR (2022) RL (MCTS + GCN) Shift table Top‑3 ~80% Effective for small molecules
High‑resolution iterative Full Spin Analysis (HiFSA) for small molecules using PERCH (2015) Spectral ID Simulated spectra Useful for detailed peak assignment
Automated mixture component identification via wavelet packet transform and optimization (2023) Mixture ID (WPT + Optimization) Mixtures Component-level accuracy Robust for complex sample spectra

🧬 NMR Dataset Comparison Table

Dataset Name & Link Spectrum Count Real / Simulated Multi-modal Spectra Labeled Downloadable / Crawlable
NMRShiftDB2 ~50,000 Real ¹H, ¹³C ✅ Yes ✅ Yes (open source)
BMRB >13,000 biomolecules Real ¹H, ¹³C, ¹⁵N, ²H, ³¹P ✅ Yes ✅ Yes (FTP/STAR)
SDBS ~14,000 Real ¹H, ¹³C, IR, MS, UV ✅ Yes ✅ Yes (Crawl Script Needed)
QM9-NMR (Simulated) 130,000+ Simulated (DFT) ¹H, ¹³C ✅ Yes ✅ Yes (via DOI or GitHub)
2DNMRGym (2024) 22,000 2D HSQC Simulated HSQC (2D) ✅ Yes ✅ Yes (HuggingFace)
NMRMixDB ~3,000 mixtures Real ¹H ✅ Yes (with labels) ✅ Yes
NMRPredBench ~3,000 Real + Simulated ¹H, ¹³C ✅ Yes ✅ Yes (GitHub)
MolAid ~840K+ Experimental Multi-property ✅ Yes ❌ No(API Chared)
NIST WebBook ~700K+ Experimental ¹H, ¹³C etc. ✅ Yes ✅ Yes (Need Search Key)
PubChem ~100M+ Experimental + Predicted Full compound attributes ✅ Yes ✅ Yes (API)

4. IR Spectroscopy (Small Molecules)

4.1 Forward Task (Molecule → Spectrum)

Infrared spectrum prediction from molecular structures

4.2 Inverse Task (Spectrum → Molecule)

Molecular characterization from infrared spectra


5. Multimodal Spectroscopy (Small Molecules)

5.1 Forward Task (Molecule → Multiple Spectra)

Joint prediction of multiple spectral modalities from molecular structures

5.2 Inverse Task (Multiple Spectra → Molecule)

Multimodal integration for enhanced molecular identification


6. X-ray Diffraction (XRD) (Crystals)

6.1 Forward Task (Crystal → Pattern)

Prediction of XRD patterns from crystal structures

6.2 Inverse Task (Pattern → Crystal)

Crystal structure determination from XRD patterns


License

📄 This project is licensed under the MIT License — see the LICENSE file for details.


About

✨✨ Latest Advances on AI for Spectra Data Analysis (SpectraAI) in MS&mol

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors