Make training code available? #2

JohnnyC08 · 2021-04-27T18:02:28Z

I'm interested in reproducing the model in pytorch and am curious how you preprocessed the data and trained it. I didn't see any metrics reported and want to see what those look like as well! So, the training script would be nice to have as well!

Great repo by the way!

creatorrr · 2021-05-11T08:45:33Z

+1 @JohnnyC08

It'd also be interesting to see if including context (conversation history / last utterances) improves the accuracy of predictions.

JohnnyC08 · 2021-05-17T20:15:36Z

@creatorrr That's interesting.

How would you go about doing that? My first thought is using a rolling window and making a single block of text out of the elements of the window and assigning the label to the text block of the last element in the block.

How would you do it?

creatorrr · 2021-06-01T07:54:34Z

@JohnnyC08 I was thinking of something simpler, prepending dialog act labels of the last three utterances to the input vector when finetuning it. For example, take this conversation:

A: Do you want to grab lunch? [Yes-No-Question]
B: Not really. [Dispreferred-answers]
A: Oh okay. [Response-Acknowledgement]
B: How about tomorrow? <<TO PREDICT>>

Then the input vector would be:
[CLS] Yes-No-Question [SEP] Dispreferred-answers [SEP] Response-Acknowledgement [SEP] How about tomorrow? [SEP]

argideritzalpea · 2021-12-16T20:54:45Z

@creatorrr @JohnnyC08 Did either of you end up creating a previous context-dependent model? Also, were you able to successfully predict on a GPU? Loading the model is entirely allocating all my card's memory, suggesting a leak in the loading of the model that is downloaded.

argideritzalpea · 2021-12-18T17:48:15Z

@bhavitvyamalik Thanks again for publishing the model. I think that some comments on how training was conducted would really make this repo more complete.

What are the inputs for training on the SBWA corpus? Are they single sentences or sequences of sentences?

What training scripts were used to train this model?

Any utilities to customize this for another dataset?

What parameters were used for fine-tuning?

What outputs of the DistilBert encoding are used for the classification task?

I am attempting to use this for DA labeling on a conversational dataset and it is giving various and poor results for the same simple sentence "Okay." I assume this is because of dropout and also over or under fitting. Overall I'm not sure that this model gives me confidence required to use for my project as is. If training scripts and the data were released that would be awesome!

creatorrr · 2022-01-19T01:23:17Z

@creatorrr @JohnnyC08 Did either of you end up creating a previous context-dependent model? Also, were you able to successfully predict on a GPU? Loading the model is entirely allocating all my card's memory, suggesting a leak in the loading of the model that is downloaded.

Haven’t gotten around to it yet, been really busy but will give it a try one of these weekends @argideritzalpea

creatorrr · 2022-01-19T01:24:39Z

@creatorrr That's interesting.

How would you go about doing that? My first thought is using a rolling window and making a single block of text out of the elements of the window and assigning the label to the text block of the last element in the block.

How would you do it?

Ever got a chance to try this out @JohnnyC08 ?

hannan72 · 2022-08-21T07:49:56Z

Hi,
Could you please share the training scripts?
Also could you please share the link to the training data?

creatorrr · 2022-09-11T05:09:04Z

@JohnnyC08 I ended up training a deberta based dialog act classifer on silicone-merged dataset using sentence pairs (previous utterance, current utterance) and it performs better than single utterances. You can take a look here.

angoodkind mentioned this issue Mar 17, 2022

Details of pretrained model #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make training code available? #2

Make training code available? #2

JohnnyC08 commented Apr 27, 2021

creatorrr commented May 11, 2021

JohnnyC08 commented May 17, 2021

creatorrr commented Jun 1, 2021

argideritzalpea commented Dec 16, 2021

argideritzalpea commented Dec 18, 2021 •

edited

Loading

creatorrr commented Jan 19, 2022 •

edited

Loading

creatorrr commented Jan 19, 2022

hannan72 commented Aug 21, 2022 •

edited

Loading

creatorrr commented Sep 11, 2022

Make training code available? #2

Make training code available? #2

Comments

JohnnyC08 commented Apr 27, 2021

creatorrr commented May 11, 2021

JohnnyC08 commented May 17, 2021

creatorrr commented Jun 1, 2021

argideritzalpea commented Dec 16, 2021

argideritzalpea commented Dec 18, 2021 • edited Loading

creatorrr commented Jan 19, 2022 • edited Loading

creatorrr commented Jan 19, 2022

hannan72 commented Aug 21, 2022 • edited Loading

creatorrr commented Sep 11, 2022

argideritzalpea commented Dec 18, 2021 •

edited

Loading

creatorrr commented Jan 19, 2022 •

edited

Loading

hannan72 commented Aug 21, 2022 •

edited

Loading