Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
add input_output_map and multi-vote (#199)
* modify prompt templete * modify prompt generation * add new prompt for generator * add new prompt for generator * add new prompt for generator to solve unexpected Json of Json * delete duplication * new mock function * new prompt template * new prompt template for dataset generator * add random generted few-shot examples * split into two prompt template * new middle template * add left padding * add max_new_tokens * add max_new_tokens * add max_new_tokens and control for GPT2 * debug single generation for executor * add new unit test for model executor * add <eos> for processor * add new test for trainer * Refactor model executor * [use max length] * [use max length] strict beam search * add more constraint on sequence length * release an if control in trainer * Add more device for model executor (#146) * add device change for executor * use cpu to inference the trainer * use cuda to inference the trainer * add new assertion * add new assertion for metric training * add logging for training * Use tiny-gpt2 instead of full gpt-2 for tests * reduce memory cost for trainer test --------- Co-authored-by: zhaochen20 <[email protected]> Co-authored-by: viswavi <[email protected]> * use small models at all * use small model for testing, fix format * inference on cuda * split test helper of model creator * revert generator * revert generator * add warning for none constraints of sequence_max_length * fix lint * fix lint * add test for trunction in trainer * fix review * add label to processor (#149) * add device change for executor * use cpu to inference the trainer * use cuda to inference the trainer * add new assertion * add new assertion for metric training * add logging for training * add label to processor --------- Co-authored-by: zhaochen20 <[email protected]> * fix review * fix review. change name * fix review. change name * add label unit test * add comments for unit tests * add comments for the magic number -100 * add delimiter for processor, add new test for tokenization of trainer * fix typecheck for 3.9 and 3.10 * add , add new unit test for tokenize dataset and add comments. * greedy decode * greedy decode * fix bug for eos_token_id * add repetition_penalty * add print to prediction * new tests for autoregressive model * new tests for autoregressive model * fix comment * fix typo * fix assert, add new unit test * add new test cases and real test for trainer * delete confidence * refactor unit tests * add 5 generation strategy * add 5 generation strategy * fix OOM problem of trainer test * fix OOM problem of trainer test * fix OOM problem of trainer test * delete unecessary eos * delete unecessary eos * refactor evaluator * use sub set * fix oom * fix oom * add comments for evaluator * add new comments for the processor * refactor model executor * refactor training callback * gc.collect() * use gc.collect() to fix oom * concatenate tests for trainer * fix multi in unit tests * Component document (#132) * new readme * change openAI key requirement * add components usage * add __future__ * new contribution.md * new datasetgenerator.md * new datasetgenerator.md * new readme for current componets * delete T5Trainer * delete T5Trainer * add new readme for pipeline * new name * add document for pipelne * [Stash] * transform new format * add new docs * add new docs * Add model_max_length (#88) * add new model_max_length * add new unit test * add new wanrning * add annotation --------- Co-authored-by: zhaochen20 <[email protected]> * Support None Model Max Length (#92) * add if branch of model_max_length * add if branch of model_max_length --------- Co-authored-by: zhaochen20 <[email protected]> * add new readme for trainer and excutor * Update README.md * Update readme.md * Update readme.md * Update readme.md * add new readme for demo creator * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * Update readme.md * readme for model evaluator * lint * add device change for executor, add new model trainer * add device change for executor, add new model trainer * add document for API KEY * update dataset generator * need to refactor model executor * add new document for model executor * add new document for refactored components * add new document for trainer * del trainer * del trainer * add new garbage cash * add new garbage cash --------- Co-authored-by: zhaochen20 <[email protected]> Co-authored-by: Chenyang Zhao <[email protected]> Co-authored-by: Chenyang Zhao <[email protected]> * refactor generator * refactor generator and add comments * fix lint * fix lint * add new meta_prompt * add new meta examples * change temperature * add new paramerters for generator * fix lint * fix lint * add constraints to prompt too long * use zeno in a batch * add gc.collect() * add error control * merge test files for trainer * add tests for t5 * add zeno's new release * add multi-generation and debug * [set base parameter] * add requests_per_minute and responses_per_requests * set max_length to 500 * delecte wrong max_new_tokens in executor * distinguish high-equality and low-equality examples for generator * merge main. Fix readme * add new member variables * add input_output_map and multi-vote * change the labeling logics * add truncation warning for executor * use execption in dataset generator * use execption in dataset generator * merge main * merge main * update docstring Co-authored-by: Graham Neubig <[email protected]> * fix review from graham * Update tests/dataset_generator_with_filter_test.py Co-authored-by: Graham Neubig <[email protected]> * use none stateful function * add new member variables (#198) * add new member variables * update docstring Co-authored-by: Graham Neubig <[email protected]> * fix review from graham --------- Co-authored-by: zhaochen20 <[email protected]> Co-authored-by: Graham Neubig <[email protected]> * use none stateful function for extract responses * add new comments * fix grammar * Update test_helpers/dataset_tools.py Co-authored-by: Graham Neubig <[email protected]> * Update test_helpers/dataset_tools.py Co-authored-by: Graham Neubig <[email protected]> * fix lint * fix wrong comments * Update prompt2model/dataset_generator/openai_gpt.py Co-authored-by: Graham Neubig <[email protected]> * fix lint * fix grammar error --------- Co-authored-by: zhaochen20 <[email protected]> Co-authored-by: viswavi <[email protected]> Co-authored-by: Chenyang Zhao <[email protected]> Co-authored-by: Chenyang Zhao <[email protected]> Co-authored-by: Graham Neubig <[email protected]>
- Loading branch information