Raman-01: Pocket Physics Solver LLM

Raman-01 is a compact, RL-finetuned LLM tailored specifically for solving physics problems. Built to be lightweight and easily deployable, it excels at tackling easy-to-medium difficulty physics questions across diverse domains.

https://huggingface.co/think-a-tron/raman-01-1.7B

🚀 Model Overview

Base Model: Qwen3-1.7B (cold-start supervised fine-tuning)
- SFT Data: 1500 samples covering kinematics, electromagnetism, acoustics, and other fundamental physics domains.
- SFT Training: 3 epochs, achieving a loss reduction to ~0.3.
Reinforcement Learning (GRPO) Fine-tuning:
- Dataset: Single carefully-selected medium-difficulty physics sample. (1-shot RLVR)
- Training Steps: 70 GRPO steps
- Reward Progression: Improved from an initial reward of 0.1 to 0.8 by training completion.
- Methodology closely follows the training style pioneered by the DeepSeek-R1 model but on a smaller, targeted scale.

🎯 Use Cases

Ideal for quick, reliable physics problem-solving in:

Mobile or edge deployments requiring minimal computational resources.
Educational tools and pocket-sized physics assistants.
Rapid prototyping of physics-related AI applications.

📊 Performance & Benchmarks

Demonstrates strong performance on easy-to-medium difficulty physics problems, especially effective in fundamental physics domains such as mechanics, electromagnetism, and basic acoustics.
Evaluations recommended using PhyBench or similar physics-focused benchmarks for accuracy assessment.

⚙️ Deployment

Compact model size (1.7B parameters) optimized for rapid inference on CPU/GPU.
Easily deployable via popular frameworks (PyTorch, Hugging Face, vLLM).

📝 Limitations

Specialized on easy-medium physics problems; performance on complex, multi-domain or advanced theoretical questions may vary.
Trained primarily to demonstrate capabilities in a constrained setting; broader fine-tuning advised for production-grade deployments.

🔗 Citation

If you utilize Raman-01, please acknowledge accordingly:

@misc{raman01,
  author = {Sai Praneeth Diddigam},
  title = {Raman-01: Compact RL-Enhanced Physics Solver},
  year = {2025},
  note = {RL (GRPO) finetuned on Qwen3-1.7B}
}

Developed to be your go-to physics solver, Raman-01 packs powerful performance into a conveniently small footprint.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
dataset.py		dataset.py
grpo.py		grpo.py
pyproject.toml		pyproject.toml
sft.py		sft.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Raman-01: Pocket Physics Solver LLM

🚀 Model Overview

🎯 Use Cases

📊 Performance & Benchmarks

⚙️ Deployment

📝 Limitations

🔗 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Raman-01: Pocket Physics Solver LLM

🚀 Model Overview

🎯 Use Cases

📊 Performance & Benchmarks

⚙️ Deployment

📝 Limitations

🔗 Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages