Skip to content

Commit 480103b

Browse files
committed
Refactor train.py with Enhanced Dataset Processing and Tokenization
- Completely rewrote training script with robust dataset handling - Added support for ShareGPT and Alpaca-style datasets - Implemented advanced formatting and tokenization functions - Enhanced debug logging and error handling - Simplified model preparation and training workflow - Updated chat template and tokenization strategies - Improved inference and model saving methods
1 parent 0435789 commit 480103b

File tree

1 file changed

+235
-137
lines changed

1 file changed

+235
-137
lines changed

0 commit comments

Comments
 (0)