Upload images to generate JSONL training data for multimodal fine‑tuning
中文 | EnglishThis project generates JSONL training data from images, with a simple frontend UI.
output.compress-video-online.com.2.mp4
Linux/macOS:
cd backend
bash start.shWindows:
cd backend
start.bat- Q: Which image formats are supported?
- A: JPG, PNG, WebP. Recommended size < 10MB per image.
- Q: How to process a large batch of images?
- A: You can upload multiple images at once; recommend ≤ 50 per batch.
- Q: Where are the generated files stored?
- A: In
backend/outputs/on the server, and they are auto‑downloaded to your computer.
- A: In
- Q: Can I customize the output data format?
- A: Yes. Modify
create_training_datainbackend/app.py.
- A: Yes. Modify
Issues and PRs are welcome for feature improvements, bug fixes, and documentation.
Explore our community (Chinese): 👉 Tech Community | Fufan Space
