[WIP] project proposal #18

oserikov · 2018-11-01T14:42:56Z

The project on turkic phonetics and NNs interpretation.

no timeline, abstract, intro, references yet

oserikov · 2018-11-01T14:43:09Z

work in progress

ftyers · 2018-11-04T00:42:31Z

Good so far. I'd like to use Dynet as the NN backend if possible. Would that be ok?

oserikov · 2018-11-06T23:10:37Z

Yes, why not :) I also wanted to play with pyTorch, so if it'll fit and I'll have free time I'd reimplement Dynet part on pyTorch. BTW, why Dynet?

ftyers · 2018-11-06T23:12:33Z

I've heard DyNet trains faster on CPU compared to tensorflow/theano/etc. In addition, it's probably a bit easier to install, and doesn't require non-free software like CUDA. :)

oserikov · 2018-11-06T23:22:03Z

Oh, got it. It will be interesting to compare pyTorch and Dynet performance then! And finally CUDA seems to be non-open-source, but freware, so the available use cases are not really obvious for me :\

ftyers · 2018-11-07T14:05:40Z

Oh, got it. It will be interesting to compare pyTorch and Dynet performance then! And finally CUDA seems to be non-open-source, but freware, so the available use cases are not really obvious for me :\

Yes, that would be. Btw, when I say "non-free" I'm referring to free software as defined by the FSF (see here), I don't mean бесплатный :)

ftyers · 2018-11-07T14:08:36Z

2018-komp-ling/projects/serikov.md

+* Vizualization skill.
+
+#### Sub-goals
+* 1 week| Reproduce the dataset used in original paper


"The input to the network is a series of sequentially presented phonemes from a corpus of 602 Turkish words. "

This shouldn't take any time at all. I can provide you with the words.

this week the input data reproduction took ~3 days, and there still being some questions unanswered, so I think that weekly buffer to deal with the possible problems with the data collection could be helpful.

ftyers · 2018-11-07T14:09:04Z

2018-komp-ling/projects/serikov.md

+### EP requirements
+
+#### Sub-goals
+* 1 week| Collect the data to repeat the research on different languages data


For which languages do you have phonemes sequences? Just asking, it was interesting to know the best way to collect data like that.

oserikov · 2018-11-13T23:48:20Z

I plan to start working on the proj ~ 19 november -- i'll spend a couple of days setting up dependencies and reading guides, so ~21th november is a good day to start, isn't it? Following the timeline I should finish EP before the start of the 3rd module in HSE

ftyers · 2018-11-13T23:51:16Z

Great! Just let me know when you need some data. If it goes well, it could be an ACL short paper (deadline 4th March). :)

oserikov · 2018-11-20T01:44:57Z

Pretty much any of the Turkic languages, you can do something like:
$ cat apertium-tur//apertium-tur.tur.lexc  |\
 grep -v '^!' |\
 grep '[^<> ]\+:[^<> ]\+ $N[^PU]\|V$[^ ]\+ ;' | cut -f1 -d':' |\
 sort -Ru | head -1000
From apertium-tur.

That command looks to catch the lexemes, but aren't the NNs described on the paper waiting for phonemes, not characters?

ftyers · 2018-11-20T02:25:47Z

@oserikov She says (p.2): "However, this phenomenon of consonant harmony can clearly not be considered in this study, as the two allophones for these consonants are represented by the same phoneme in the input data." This suggests that she is using just the surface characters not phonemes. We should do the same.

[WIP] project proposal.

303c2fa

no timeline, abstract, intro, references yet

added time consumption info, fixed typos.

600df73

ftyers reviewed Nov 7, 2018

View reviewed changes

oserikov added 2 commits November 27, 2018 03:12

wip: introduction

f15f304

added timings for mvp

6806fcd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] project proposal #18

[WIP] project proposal #18

Uh oh!

oserikov commented Nov 1, 2018

Uh oh!

oserikov commented Nov 1, 2018

Uh oh!

ftyers commented Nov 4, 2018

Uh oh!

oserikov commented Nov 6, 2018 •

edited

Loading

Uh oh!

ftyers commented Nov 6, 2018

Uh oh!

oserikov commented Nov 6, 2018

Uh oh!

ftyers commented Nov 7, 2018

Uh oh!

ftyers Nov 7, 2018

Uh oh!

oserikov Nov 23, 2018

Uh oh!

ftyers Nov 7, 2018

Uh oh!

oserikov Nov 7, 2018

Uh oh!

ftyers Nov 7, 2018

Uh oh!

oserikov commented Nov 13, 2018

Uh oh!

ftyers commented Nov 13, 2018 •

edited

Loading

Uh oh!

oserikov commented Nov 20, 2018

Uh oh!

ftyers commented Nov 20, 2018

Uh oh!

Uh oh!

[WIP] project proposal #18

Are you sure you want to change the base?

[WIP] project proposal #18

Uh oh!

Conversation

oserikov commented Nov 1, 2018

Uh oh!

oserikov commented Nov 1, 2018

Uh oh!

ftyers commented Nov 4, 2018

Uh oh!

oserikov commented Nov 6, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ftyers commented Nov 6, 2018

Uh oh!

oserikov commented Nov 6, 2018

Uh oh!

ftyers commented Nov 7, 2018

Uh oh!

ftyers Nov 7, 2018

Choose a reason for hiding this comment

Uh oh!

oserikov Nov 23, 2018

Choose a reason for hiding this comment

Uh oh!

ftyers Nov 7, 2018

Choose a reason for hiding this comment

Uh oh!

oserikov Nov 7, 2018

Choose a reason for hiding this comment

Uh oh!

ftyers Nov 7, 2018

Choose a reason for hiding this comment

Uh oh!

oserikov commented Nov 13, 2018

Uh oh!

ftyers commented Nov 13, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oserikov commented Nov 20, 2018

Uh oh!

ftyers commented Nov 20, 2018

Uh oh!

Uh oh!

oserikov commented Nov 6, 2018 •

edited

Loading

ftyers commented Nov 13, 2018 •

edited

Loading