Commit 480103b

committed

Refactor train.py with Enhanced Dataset Processing and Tokenization

- Completely rewrote training script with robust dataset handling - Added support for ShareGPT and Alpaca-style datasets - Implemented advanced formatting and tokenization functions - Enhanced debug logging and error handling - Simplified model preparation and training workflow - Updated chat template and tokenization strategies - Improved inference and model saving methods

1 parent 0435789 commit 480103bCopy full SHA for 480103b

1 file changed

+235

-137

lines changed

praisonai
- train.py

1 file changed

+235

-137

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit 480103b

1 file changed

1 file changed

File tree

1 file changed

1 file changed

0 commit comments