pubs.html

---
layout: default
title: Publications -- Michael I Mandel
---
<div class="page-header">
<h1>Publications</h1>
</div>
<h2>Theses, Chapters</h2>
<table>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel19">1</a>]
</td>
<td class="bibtexitem">
Michael Mandel, Justin Salamon, and Daniel&nbsp;P.W. Ellis, editors.
 <em>Proceedings of the Detection and Classification of Acoustic
  Scenes and Events 2019 Workshop (DCASE2019)</em>.
 New York University, NY, USA, October 2019.
[&nbsp;<a href="pubs_bib.html#mandel19">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.33682/1syg-dy60">DOI</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel16e">2</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel, Shoko Araki, and Tomohiro Nakatani.
 Multichannel clustering and classification approaches.
 In Emmanuel Vincent, Tuomas Virtanen, and Sharon Gannot, editors,
  <em>Audio Source Separation and Speech Enhancement</em>, chapter&nbsp;12. Wiley,
  2018.
[&nbsp;<a href="pubs_bib.html#mandel16e">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="MandelAndBarker2017">3</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel and Jon&nbsp;P Barker.
 Multichannel spatial clustering using model-based source separation.
 In Shinji Watanabe, Marc Delcroix, Florian Metze, and John&nbsp;R.
  Hershey, editors, <em>New Era for Robust Speech Recognition: Exploiting,
  Deep Learning</em>, chapter&nbsp;3. Springer, 2017.
[&nbsp;<a href="pubs_bib.html#MandelAndBarker2017">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1007/978-3-319-64680-0">DOI</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="XiaoEtAl2017">4</a>]
</td>
<td class="bibtexitem">
Xiong Xiao, Shinji Watanabe, Hakan Erdogan, Michael Mandel, Liang Lu, John&nbsp;R.
  Hershey, Michael&nbsp;L. Seltzer, Guoguo Chen, Yu&nbsp;Zhang, and Dong Yu.
 Discriminative beamforming with phase-aware neural networks for
  speech enhancement and recognition.
 In Shinji Watanabe, Marc Delcroix, Florian Metze, and John&nbsp;R.
  Hershey, editors, <em>New Era for Robust Speech Recognition: Exploiting,
  Deep Learning</em>, chapter&nbsp;4. Springer, 2017.
[&nbsp;<a href="pubs_bib.html#XiaoEtAl2017">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1007/978-3-319-64680-0">DOI</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="devaney16b">5</a>]
</td>
<td class="bibtexitem">
Johanna Devaney, Michael&nbsp;I Mandel, Douglas Turnbull, and George Tzanetakis,
  editors.
 <em>Proceedings of the 17th International Society for Music
  Information Retrieval Conference (ISMIR)</em>.
 New York, 2016.
[&nbsp;<a href="pubs_bib.html#devaney16b">bib</a>&nbsp;| 
<a href="https://drive.google.com/file/d/0B2SQvWn0_78BaWxUNEdyakROLWM/view?usp=sharing">http</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="bertin-mahieux09">6</a>]
</td>
<td class="bibtexitem">
Thierry Bertin-Mahieux, Douglas Eck, and Michael&nbsp;I. Mandel.
 Automatic tagging of audio: The state-of-the-art.
 In Wenwu Wang, editor, <em>Machine Audition: Principles, Algorithms
  and Systems</em>, chapter&nbsp;14, pages 334--352. IGI Publishing, 2010.
[&nbsp;<a href="pubs_bib.html#bertin-mahieux09">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel09c">7</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel.
 <em>Binaural Model-Based Source Separation and Localization</em>.
 PhD thesis, Columbia University, February 2010.
[&nbsp;<a href="pubs_bib.html#mandel09c">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/dissertation.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel09c">Abstract</a>&nbsp;]

</td>
</tr>
</table>

<h2>Journal</h2>
<table>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="TrinhAndMandel2020b">1</a>]
</td>
<td class="bibtexitem">
Vieh&nbsp;Anh Trinh and Michael&nbsp;I Mandel.
 Directly comparing the listening strategies of humans and machines.
 <em>IEEE Transactions on Audio, Speech, and Language Processing</em>,
  29:312--323, 2021.
[&nbsp;<a href="pubs_bib.html#TrinhAndMandel2020b">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/TASLP.2020.3040545">DOI</a>&nbsp;| 
<a href="pubs_abs.html#TrinhAndMandel2020b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandelEtAl2019">2</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel, Vikas Grover, Mengxuan Zhao, Jiyoung Choi, and Valerie
  Shafer.
 The bubble-noise technique for speech perception research.
 <em>Perspectives of the ASHA Special Interest Groups</em>,
  4(6):1653--1666, 2019.
[&nbsp;<a href="pubs_bib.html#mandelEtAl2019">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1044/2019_PERS-19-00058">DOI</a>&nbsp;| 
<a href="pubs_abs.html#mandelEtAl2019">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel16c">3</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel, Sarah&nbsp;E Yoho, and Eric&nbsp;W Healy.
 Measuring time-frequency importance functions of speech with bubble
  noise.
 <em>Journal of the Acoustical Society of America</em>, 140:2542--2553,
  2016.
[&nbsp;<a href="pubs_bib.html#mandel16c">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1121/1.4964102">DOI</a>&nbsp;| 
<a href="http://github.com/mim/auditoryBubbles">Code</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/jasa16.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel16c">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="larochelle12">4</a>]
</td>
<td class="bibtexitem">
Hugo Larochelle, Michael&nbsp;I Mandel, Razvan Pascanu, and Yoshua Bengio.
 Learning algorithms for the classification restricted boltzmann
  machine.
 <em>Journal of Machine Learning Research</em>, 13:643--669, March 2012.
[&nbsp;<a href="pubs_bib.html#larochelle12">bib</a>&nbsp;| 
<a href="http://www.jmlr.org/papers/volume13/larochelle12a/larochelle12a.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#larochelle12">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="weiss11">5</a>]
</td>
<td class="bibtexitem">
Ron Weiss, Michael&nbsp;I. Mandel, and Daniel P.&nbsp;W. Ellis.
 Combining localization cues and source model constraints for binaural
  source separation.
 <em>Speech Communication</em>, 53(5):606--621, May 2011.
[&nbsp;<a href="pubs_bib.html#weiss11">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1016/j.specom.2011.01.003">DOI</a>&nbsp;| 
<a href="http://www.ee.columbia.edu/~dpwe/pubs/WeissME11-messlev.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#weiss11">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel11b">6</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel, Razvan Pascanu, Douglas Eck, Yoshua Bengio, Luca&nbsp;M. Aiello,
  Rossano Schifanella, and Filippo Menczer.
 Contextual tag inference.
 <em>ACM Transactions on Multimedia Computing, Communications and
  Applications</em>, 7S(1):32:1--32:18, October 2011.
[&nbsp;<a href="pubs_bib.html#mandel11b">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1145/2037676.2037689">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/tomccap11.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel11b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="devaney12a">7</a>]
</td>
<td class="bibtexitem">
Johanna Devaney, Michael&nbsp;I. Mandel, Daniel P.&nbsp;W. Ellis, and Ichiro Fujinaga.
 Automatically extracting performance data from recordings of trained
  singers.
 <em>Psychomusicology: Music, Mind &amp; Brain</em>, 21(1–-2):108--136,
  2012.
[&nbsp;<a href="pubs_bib.html#devaney12a">bib</a>&nbsp;| 
<a href="http://music.mcgill.ca/~devaney/files/devaney11automatically.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#devaney12a">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel10a">8</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel, Scott Bressler, Barbara Shinn-Cunningham, and Daniel P.&nbsp;W.
  Ellis.
 Evaluating source separation algorithms with reverberant speech.
 <em>IEEE Transactions on Audio, Speech, and Language Processing</em>,
  18(7):1872--1883, 2010.
[&nbsp;<a href="pubs_bib.html#mandel10a">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/TASL.2010.2052252">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/taslp10b.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel10a">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel09a">9</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel, Ron&nbsp;J. Weiss, and Daniel P.&nbsp;W. Ellis.
 Model-based expectation maximization source separation and
  localization.
 <em>IEEE Transactions on Audio, Speech, and Language Processing</em>,
  18(2):382--394, February 2010.
[&nbsp;<a href="pubs_bib.html#mandel09a">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/TASL.2009.2029711">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/taslp10.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel09a">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel08b">10</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel and Daniel P.&nbsp;W. Ellis.
 A web-based game for collecting music metadata.
 <em>Journal of New Music Research</em>, 37(2):151--165, 2008.
[&nbsp;<a href="pubs_bib.html#mandel08b">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1080/09298210802479300">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/jnmr08.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel08b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="huang08">11</a>]
</td>
<td class="bibtexitem">
Thomas&nbsp;S. Huang, Charlie&nbsp;K. Dagli, Shyamsundar Rajaram, Edward&nbsp;Y. Chang,
  Michael&nbsp;I. Mandel, Graham&nbsp;E. Poliner, and Daniel P.&nbsp;W. Ellis.
 Active learning for interactive multimedia retrieval.
 <em>Proceedings of the IEEE</em>, 96(4):648--667, 2008.
[&nbsp;<a href="pubs_bib.html#huang08">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/JPROC.2008.916364">DOI</a>&nbsp;| 
<a href="pubs_abs.html#huang08">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel06b">12</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel, Graham&nbsp;E. Poliner, and Daniel P.&nbsp;W. Ellis.
 Support vector machine active learning for music retrieval.
 <em>Multimedia systems</em>, 12(1):1--11, August 2006.
[&nbsp;<a href="pubs_bib.html#mandel06b">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1007/s00530-006-0032-2">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/mmsj05.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel06b">Abstract</a>&nbsp;]

</td>
</tr>
</table>

<h2>Conference</h2>
<table>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="CobanEtAl2023">1</a>]
</td>
<td class="bibtexitem">
Enis&nbsp;Berk &#x00c7;oban, Megan Perra, and Michael&nbsp;I Mandel.
 Towards high resolution weather monitoring with sound data.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, 2024.
 To appear.
[&nbsp;<a href="pubs_bib.html#CobanEtAl2023">bib</a>&nbsp;| 
<a href="pubs_abs.html#CobanEtAl2023">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="SyedAndMandel2023">2</a>]
</td>
<td class="bibtexitem">
Ali&nbsp;Raza Syed and Michael&nbsp;I Mandel.
 Estimating shapley values of training utterances for automatic speech
  recognition models.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, 2023.
[&nbsp;<a href="pubs_bib.html#SyedAndMandel2023">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="TrinhEtAl2022">3</a>]
</td>
<td class="bibtexitem">
Viet&nbsp;Ahn Trinh, Hassan&nbsp;Salami Kavaki, and Michael&nbsp;I Mandel.
 Importantaug: a data augmentation agent for speech.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, 2022.
[&nbsp;<a href="pubs_bib.html#TrinhEtAl2022">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="CobanEtAl2022">4</a>]
</td>
<td class="bibtexitem">
Enis&nbsp;Berk &#x00c7;oban, Megan Perra, Dara Pir, and Michael&nbsp;I Mandel.
 Edansa-2019: The ecoacoustic dataset from arctic north slope alaska.
 In <em>Workshop on the Detection and Classification of Audio Scenes
  and Environments</em>, 2022.
[&nbsp;<a href="pubs_bib.html#CobanEtAl2022">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="CobanEtAl2021">5</a>]
</td>
<td class="bibtexitem">
Enis&nbsp;Berk &#x00c7;oban, Ali&nbsp;R Syed, Dara Pir, and Michael&nbsp;I Mandel.
 Towards large scale ecoacoustic monitoring with small amounts of
  labeled data.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, 2021.
[&nbsp;<a href="pubs_bib.html#CobanEtAl2021">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="NiEtAl2020">6</a>]
</td>
<td class="bibtexitem">
Zhaoheng Ni, Yong Xu, Meng Yu, Bo&nbsp;Wu, Shixiong Zhang, Dong Yu, and Michael&nbsp;I
  Mandel.
 WPD++: an improved neural beamformer for simultaneous speech
  separation and dereverberation.
 In <em>IEEE Workshop on Spoken Language Technologies</em>, 2020.
[&nbsp;<a href="pubs_bib.html#NiEtAl2020">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="KavakiAndMandel2020">7</a>]
</td>
<td class="bibtexitem">
Hassan&nbsp;Salami Kavaki and Michael&nbsp;I Mandel.
 Identifying important time-frequency locations in continuous speech
  utterances.
 In <em>Proceedings of Interspeech</em>, pages 1639--1643, 2020.
[&nbsp;<a href="pubs_bib.html#KavakiAndMandel2020">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2020-2637">DOI</a>&nbsp;| 
<a href="https://isca-speech.org/archive/Interspeech_2020/pdfs/2637.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#KavakiAndMandel2020">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="TrinhAndMandel2020">8</a>]
</td>
<td class="bibtexitem">
Viet&nbsp;Anh Trinh and Michael&nbsp;I. Mandel.
 Large scale evaluation of importance maps in automatic speech
  recognition.
 In <em>Proceedings of Interspeech</em>, pages 1166--1170, 2020.
[&nbsp;<a href="pubs_bib.html#TrinhAndMandel2020">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2020-2883">DOI</a>&nbsp;| 
<a href="https://www.isca-speech.org/archive/Interspeech_2020/pdfs/2883.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#TrinhAndMandel2020">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="GhalyAndMandel2020">9</a>]
</td>
<td class="bibtexitem">
Hussein Ghaly and Michael&nbsp;I Mandel.
 Using prosody to improve dependency parsing.
 In <em>Speech prosody</em>, 2020.
[&nbsp;<a href="pubs_bib.html#GhalyAndMandel2020">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="CobanEtAl2020">10</a>]
</td>
<td class="bibtexitem">
Enis&nbsp;Berk &#x00c7;oban, Dara Pir, Richard So, and Michael&nbsp;I Mandel.
 Transfer learning from youtube soundtracks to tag arctic ecoacoustic
  recordings.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, pages 726--730, 2020.
[&nbsp;<a href="pubs_bib.html#CobanEtAl2020">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ICASSP40776.2020.9053338">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp20coban.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#CobanEtAl2020">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="MaitiAndMandel2019c">11</a>]
</td>
<td class="bibtexitem">
Soumi Maiti and Michael&nbsp;I Mandel.
 Speaker independence of neural vocoders and their effect on
  parametric resynthesis speech enhancement.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, pages 206--210, 2020.
[&nbsp;<a href="pubs_bib.html#MaitiAndMandel2019c">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ICASSP40776.2020.9053296">DOI</a>&nbsp;| 
<a href="http://arxiv.org/abs/1911.06266">arXiv</a>&nbsp;| 
<a href="http://mr-pc.org/work/icassp20/">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp20maitiSlides.pdf">Slides</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp20maiti.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#MaitiAndMandel2019c">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="NiAndMandel2019">12</a>]
</td>
<td class="bibtexitem">
Zhaoheng Ni and Michael&nbsp;I Mandel.
 Mask-dependent phase estimation for monaural speaker separation.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, 2020.
[&nbsp;<a href="pubs_bib.html#NiAndMandel2019">bib</a>&nbsp;| 
<a href="http://arxiv.org/abs/1911.02746">arXiv</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp20ni.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#NiAndMandel2019">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="MaitiAndMandel2019b">13</a>]
</td>
<td class="bibtexitem">
Soumi Maiti and Michael&nbsp;I Mandel.
 Parametric resynthesis with neural vocoders.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, pages 303--307, 2019.
[&nbsp;<a href="pubs_bib.html#MaitiAndMandel2019b">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/WASPAA.2019.8937165">DOI</a>&nbsp;| 
<a href="http://arxiv.org/abs/1906.06762">arXiv</a>&nbsp;| 
<a href="http://mr-pc.org/work/waspaa19/">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa19.pdf">.pdf</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="MaitiAndMandel2019">14</a>]
</td>
<td class="bibtexitem">
Soumi Maiti and Michael&nbsp;I Mandel.
 Speech denoising by parametric resynthesis.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, pages 6995--6999, 2019.
[&nbsp;<a href="pubs_bib.html#MaitiAndMandel2019">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ICASSP.2019.8683130">DOI</a>&nbsp;| 
<a href="http://mr-pc.org/work/icassp19/">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp19poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp19.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#MaitiAndMandel2019">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="TrinhEtAl2018">15</a>]
</td>
<td class="bibtexitem">
Viet&nbsp;Anh Trinh, Brian McFee, and Michael&nbsp;I Mandel.
 Bubble cooperative networks for identifying important speech cues.
 In <em>Proceedings of Interspeech</em>, pages 1616--1620, 2018.
[&nbsp;<a href="pubs_bib.html#TrinhEtAl2018">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2018-2377">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech18trinhPoster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech18trinh.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#TrinhEtAl2018">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="SyedEtAl2018">16</a>]
</td>
<td class="bibtexitem">
Ali&nbsp;Raza Syed, Viet&nbsp;Anh Trinh, and Michael&nbsp;I. Mandel.
 Concatenative resynthesis with improved training signals for speech
  enhancement.
 In <em>Proceedings of Interspeech</em>, pages 1195--1199, 2018.
[&nbsp;<a href="pubs_bib.html#SyedEtAl2018">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2018-2439">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech18syedPoster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech18syed.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#SyedEtAl2018">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="MaitiEtAl2018">17</a>]
</td>
<td class="bibtexitem">
Soumi Maiti, Joey Ching, and Michael&nbsp;I. Mandel.
 Large vocabulary concatenative resynthesis.
 In <em>Proceedings of Interspeech</em>, pages 1190--1194, 2018.
[&nbsp;<a href="pubs_bib.html#MaitiEtAl2018">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2018-2383">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech18maitiPoster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech18maiti.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#MaitiEtAl2018">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="MaitiAndMandel2017">18</a>]
</td>
<td class="bibtexitem">
Soumi Maiti and Michael&nbsp;I Mandel.
 Concatenative resynthesis using twin networks.
 In <em>Proceedings of Interspeech</em>, pages 3647--3651, 2017.
[&nbsp;<a href="pubs_bib.html#MaitiAndMandel2017">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2017-1653">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech17.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#MaitiAndMandel2017">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="syed17">19</a>]
</td>
<td class="bibtexitem">
Ali Syed, Andrew Rosenberg, and Michael&nbsp;I Mandel.
 Active learning for low-resource speech recognition: Impact of
  selection size and language modeling data.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, 2017.
[&nbsp;<a href="pubs_bib.html#syed17">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/syed17.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#syed17">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="devaney17">20</a>]
</td>
<td class="bibtexitem">
Johanna Devaney and Michael&nbsp;I Mandel.
 An evaluation of score-informed methods for estimating fundamental
  frequency and power from polyphonic audio.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, 2017.
[&nbsp;<a href="pubs_bib.html#devaney17">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/devaney17.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#devaney17">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel16b">21</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel and Jon&nbsp;P Barker.
 Multichannel spatial clustering for robust far-field automatic speech
  recognition in mismatched conditions.
 In <em>Proceedings of Interspeech</em>, pages 1991--1995, 2016.
[&nbsp;<a href="pubs_bib.html#mandel16b">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2016-1275">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech16bslides.pdf">Slides</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech16b.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel16b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel16">22</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel.
 Directly comparing the listening strategies of humans and machines.
 In <em>Proceedings of Interspeech</em>, pages 660--664, 2016.
[&nbsp;<a href="pubs_bib.html#mandel16">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2016-932">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech16poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech16.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel16">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="erdogan16">23</a>]
</td>
<td class="bibtexitem">
Hakan Erdogan, John Hershey, Shinji Watanabe, Michael&nbsp;I Mandel, and Jonathan&nbsp;Le
  Roux.
 Improved MVDR beamforming using single-channel mask prediction
  networks.
 In <em>Proceedings of Interspeech</em>, pages 1981--1985, 2016.
[&nbsp;<a href="pubs_bib.html#erdogan16">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.21437/Interspeech.2016-552">DOI</a>&nbsp;| 
<a href="http://www.isca-speech.org/archive/Interspeech_2016/pdfs/0552.PDF">.PDF</a>&nbsp;| 
<a href="pubs_abs.html#erdogan16">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="xiao16">24</a>]
</td>
<td class="bibtexitem">
Xiong Xiao, Shinji Watanabe, Hakan Erdogan, Liang Lu, John Hershey, Michael&nbsp;L
  Seltzer, Guoguo Chen, Yu&nbsp;Zhang, Michael Mandel, and Dong Yu.
 Deep beamforming networks for multi-channel speech recognition.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, pages 5745--5749. IEEE, mar 2016.
[&nbsp;<a href="pubs_bib.html#xiao16">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ICASSP.2016.7472778">DOI</a>&nbsp;| 
<a href="http://www.clsp.jhu.edu/~guoguo/papers/icassp2016_deep_beamforming.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#xiao16">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="bagchi15">25</a>]
</td>
<td class="bibtexitem">
Deblin Bagchi, Michael&nbsp;I Mandel, Zhongqiu Wang, Yanzhang He, Andrew Plummer,
  and Eric Fosler-Lussier.
 Combining spectral feature mapping and multi-channel model-based
  source separation for noise-robust automatic speech recognition.
 In <em>Proceedings of the IEEE Workshop on Automatic Speech
  Recognition and Understanding</em>, pages 496--503, 2015.
[&nbsp;<a href="pubs_bib.html#bagchi15">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ASRU.2015.7404836">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/asru15.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#bagchi15">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="tirumala15">26</a>]
</td>
<td class="bibtexitem">
Sreyas&nbsp;Srimath Tirumala and Michael&nbsp;I Mandel.
 Exciting estimated clean spectra for speech resynthesis.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, 2015.
[&nbsp;<a href="pubs_bib.html#tirumala15">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa15bposter.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa15b.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#tirumala15">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel15d">27</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel and Young&nbsp;Suk Cho.
 Audio super-resolution using concatenative resynthesis.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, 2015.
[&nbsp;<a href="pubs_bib.html#mandel15d">bib</a>&nbsp;| 
<a href="http://mr-pc.org/work/waspaa15/">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa15slides.pdf">Slides</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa15.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel15d">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel15c">28</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel and Nicoleta Roman.
 Enforcing consistency in spectral masks using markov random fields.
 In <em>Proceedings of EUSIPCO</em>, pages 2028--2032, 2015.
[&nbsp;<a href="pubs_bib.html#mandel15c">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/eusipco15.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel15c">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel14c">29</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel, Young-Suk Cho, and Yuxuan Wang.
 Learning a concatenative resynthesis system for noise suppression.
 In <em>Proceedings of the IEEE GlobalSIP conference</em>, 2014.
[&nbsp;<a href="pubs_bib.html#mandel14c">bib</a>&nbsp;| 
<a href="http://mr-pc.org/work/globalsip14/">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/globalsip14poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/globalsip14.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel14c">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel14b">30</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel, Sarah&nbsp;E Yoho, and Eric&nbsp;W Healy.
 Generalizing time-frequency importance functions across noises,
  talkers, and phonemes.
 In <em>Proceedings of Interspeech</em>, 2014.
[&nbsp;<a href="pubs_bib.html#mandel14b">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech14poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech14.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel14b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel14a">31</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel and Arun Narayanan.
 Analysis-by-synthesis feature estimation for robust automatic speech
  recognition using spectral masks.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, 2014.
[&nbsp;<a href="pubs_bib.html#mandel14a">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp14poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp14a.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel14a">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="nandi14">32</a>]
</td>
<td class="bibtexitem">
Arnab Nandi, Lilong Jiang, and Michael&nbsp;I Mandel.
 Gestural query specification.
 In <em>Proceedings of the International Conference on Very Large
  Data Bases</em>, volume&nbsp;7, 2014.
[&nbsp;<a href="pubs_bib.html#nandi14">bib</a>&nbsp;| 
<a href="https://speakerdeck.com/arnabdotorg/gestural-query-specification-querying-without-keyboards">Slides</a>&nbsp;| 
<a href="http://www.vldb.org/pvldb/vol7/p289-nandi.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#nandi14">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel13">33</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel.
 Learning an intelligibility map of individual utterances.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, 2013.
[&nbsp;<a href="pubs_bib.html#mandel13">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa13.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel13">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="roman13">34</a>]
</td>
<td class="bibtexitem">
Nicoleta Roman and Micheal Mandel.
 Classification based binaural dereverberation.
 In <em>Proceedings of Interspeech</em>, 2013.
[&nbsp;<a href="pubs_bib.html#roman13">bib</a>&nbsp;| 
<a href="pubs_abs.html#roman13">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="devaney12b">35</a>]
</td>
<td class="bibtexitem">
Johanna Devaney, Michael&nbsp;I. Mandel, and Ichiro Fujinaga.
 A study of intonation in three-part singing using the automatic music
  performance analysis and comparison toolkit (AMPACT).
 In <em>Proceedings of the International Society for Music
  Information Retrieval conference</em>, 2012.
[&nbsp;<a href="pubs_bib.html#devaney12b">bib</a>&nbsp;| 
<a href="http://ismir2012.ismir.net/event/papers/511-ismir-2012.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#devaney12b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="devaney11">36</a>]
</td>
<td class="bibtexitem">
Johanna Devaney, Michael&nbsp;I. Mandel, and Ichiro Fujinaga.
 Characterizing singing voice fundamental frequency trajectories.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, pages 73--76, October 2011.
[&nbsp;<a href="pubs_bib.html#devaney11">bib</a>&nbsp;| 
<a href="http://music.mcgill.ca/~devaney/files/devaney11waspaaPoster.pdf">Poster</a>&nbsp;| 
<a href="http://www.music.mcgill.ca/~devaney/files/devaney11waspaa.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#devaney11">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel10b">37</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel, Douglas Eck, and Yoshua Bengio.
 Learning tags that vary within a song.
 In <em>Proceedings of the International Society for Music
  Information Retrieval conference</em>, pages 399--404, August 2010.
[&nbsp;<a href="pubs_bib.html#mandel10b">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir10slides.pdf">Slides</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir10.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel10b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="bergstra10">38</a>]
</td>
<td class="bibtexitem">
James Bergstra, Michael&nbsp;I. Mandel, and Douglas Eck.
 Scalable genre and tag prediction with spectral covariance.
 In <em>Proceedings of the International Society for Music
  Information Retrieval conference</em>, pages 507--512, August 2010.
[&nbsp;<a href="pubs_bib.html#bergstra10">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir10b.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#bergstra10">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel09b">39</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel and Daniel P.&nbsp;W. Ellis.
 The ideal interaural parameter mask: a bound on binaural separation
  systems.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, pages 85--88, October 2009.
[&nbsp;<a href="pubs_bib.html#mandel09b">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ASPAA.2009.5346506">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa09poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa09.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel09b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="devaney09">40</a>]
</td>
<td class="bibtexitem">
Johanna Devaney, Michael&nbsp;I. Mandel, and Daniel P.&nbsp;W. Ellis.
 Improving MIDI-audio alignment with acoustic features.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, pages 45--48, October 2009.
[&nbsp;<a href="pubs_bib.html#devaney09">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ASPAA.2009.5346500">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/devaney_waspaa09.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#devaney09">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="law09">41</a>]
</td>
<td class="bibtexitem">
Edith Law, Kris West, Michael&nbsp;I Mandel, Mert Bay, and J.&nbsp;Stephen Downie.
 Evaluation of algorithms using games: the case of music annotation.
 In <em>Proceedings of the International Society for Music
  Information Retrieval conference</em>, pages 387--392, October 2009.
[&nbsp;<a href="pubs_bib.html#law09">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir09.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#law09">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="weiss08">42</a>]
</td>
<td class="bibtexitem">
Ron&nbsp;J. Weiss, Michael&nbsp;I. Mandel, and Daniel P.&nbsp;W. Ellis.
 Source separation based on binaural cues and source model
  constraints.
 In <em>Proceedings of Interspeech</em>, pages 419--422, September 2008.
[&nbsp;<a href="pubs_bib.html#weiss08">bib</a>&nbsp;| 
<a href="http://www.isca-speech.org/archive/interspeech_2008/i08_0419.html">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/interspeech08.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#weiss08">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel08a">43</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel and Daniel P.&nbsp;W. Ellis.
 Multiple-instance learning for music information retrieval.
 In <em>Proceedings of the International Society for Music
  Information Retrieval conference</em>, pages 577--582, September 2008.
[&nbsp;<a href="pubs_bib.html#mandel08a">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir08poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir08.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel08a">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="ellis08">44</a>]
</td>
<td class="bibtexitem">
Daniel P.&nbsp;W. Ellis, Courtenay&nbsp;V. Cotton, and Michael&nbsp;I. Mandel.
 Cross-correlation of beat-synchronous representations for music
  similarity.
 In <em>Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing</em>, pages 57--60, April 2008.
[&nbsp;<a href="pubs_bib.html#ellis08">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ICASSP.2008.4517545">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/icassp08.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#ellis08">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel07c">45</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel and Daniel P.&nbsp;W. Ellis.
 EM localization and separation using interaural level and phase
  cues.
 In <em>IEEE Workshop on Applications of Signal Processing to Audio
  and Acoustics</em>, pages 275--278, October 2007.
[&nbsp;<a href="pubs_bib.html#mandel07c">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/10.1109/ASPAA.2007.4392987">DOI</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa07poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/waspaa07.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel07c">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel07b">46</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel and Daniel P.&nbsp;W. Ellis.
 A web-based game for collecting music metadata.
 In Simon Dixon, David Bainbridge, and Rainer Typke, editors, <em>
  Proceedings of the International Society for Music Information Retrieval
  conference</em>, pages 365--366, September 2007.
[&nbsp;<a href="pubs_bib.html#mandel07b">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir07poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir07.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel07b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel07a">47</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel, Daniel P.&nbsp;W. Ellis, and Tony Jebara.
 An EM algorithm for localizing multiple sound sources in
  reverberant environments.
 In B.&nbsp;Sch&ouml;lkopf, J.&nbsp;Platt, and T.&nbsp;Hoffman, editors, <em>Advances
  in Neural Information Processing Systems</em>, pages 953--960. MIT Press,
  Cambridge, MA, 2007.
[&nbsp;<a href="pubs_bib.html#mandel07a">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/nips06poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/nips06.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel07a">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel05">48</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel and Daniel P.&nbsp;W. Ellis.
 Song-level features and support vector machines for music
  classification.
 In Joshua&nbsp;D. Reiss and Geraint&nbsp;A. Wiggins, editors, <em>Proceedings
  of the International Society for Music Information Retrieval conference</em>,
  pages 594--599, September 2005.
[&nbsp;<a href="pubs_bib.html#mandel05">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir05poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/ismir05.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel05">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="sudderth05">49</a>]
</td>
<td class="bibtexitem">
Erik&nbsp;B. Sudderth, Michael&nbsp;I. Mandel, William&nbsp;T. Freeman, and Alan&nbsp;S. Willsky.
 Distributed occlusion reasoning for tracking with nonparametric
  belief propagation.
 In Lawrence&nbsp;K. Saul, Yair Weiss, and L&eacute;on Bottou, editors, <em>
  Advances in Neural Information Processing Systems</em>, pages 1369--1376. MIT
  Press, Cambridge, MA, 2005.
[&nbsp;<a href="pubs_bib.html#sudderth05">bib</a>&nbsp;| 
<a href="http://ssg.mit.edu/nbp/">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/nips04.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#sudderth05">Abstract</a>&nbsp;]

</td>
</tr>
</table>

<h2>Other</h2>
<table>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="DavolEtAl2021">1</a>]
</td>
<td class="bibtexitem">
Eleanor Davol, Natalie Boelman, Todd Brinkman, Carissa Brown, Glen Liston,
  Michael Mandel, Enis Coban, Megan Perra, Kirsten Reid, Scott Leorna, et&nbsp;al.
 Automated soundscape analysis reveals strong influence of time since
  wildfire on boreal breeding birds.
 In <em>AGU Fall Meeting Abstracts</em>, volume 2021, pages B23C--03,
  2021.
[&nbsp;<a href="pubs_bib.html#DavolEtAl2021">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="NiEtAl2020b">2</a>]
</td>
<td class="bibtexitem">
Zhaoheng Ni, Felix Grezes, Viet&nbsp;Anh Trinh, and Michael&nbsp;I Mandel.
 Improved MVDR beamforming using LSTM speech models to clean
  spatial clustering masks, 2020.
[&nbsp;<a href="pubs_bib.html#NiEtAl2020b">bib</a>&nbsp;| 
<a href="http://arxiv.org/abs/2012.02191">arXiv</a>&nbsp;| 
<a href="https://arxiv.org/pdf/2012.02191.pdf">.pdf</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="GrezesEtAl2020">3</a>]
</td>
<td class="bibtexitem">
Felix Grezes, Zhaoheng Ni, Viet&nbsp;Anh Trinh, and Michael Mandel.
 Enhancement of spatial clustering-based time-frequency masks using
  lstm neural networks, 2020.
[&nbsp;<a href="pubs_bib.html#GrezesEtAl2020">bib</a>&nbsp;| 
<a href="http://arxiv.org/abs/2012.01576">arXiv</a>&nbsp;| 
<a href="https://arxiv.org/pdf/2012.01576.pdf">.pdf</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="GrezesEtAl2020b">4</a>]
</td>
<td class="bibtexitem">
Felix Grezes, Zhaoheng Ni, Viet&nbsp;Anh Trinh, and Michael Mandel.
 Combining spatial clustering with lstm speech models for multichannel
  speech enhancement, 2020.
[&nbsp;<a href="pubs_bib.html#GrezesEtAl2020b">bib</a>&nbsp;| 
<a href="http://arxiv.org/abs/2012.03388">arXiv</a>&nbsp;| 
<a href="https://arxiv.org/pdf/2012.03388.pdf">.pdf</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="CaiEtAl2020">5</a>]
</td>
<td class="bibtexitem">
Tian Cai, Michael&nbsp;I Mandel, and Di&nbsp;He.
 Music autotagging as captioning.
 In <em>First Workshop on NLP for Music and Audio</em>, 2020.
[&nbsp;<a href="pubs_bib.html#CaiEtAl2020">bib</a>&nbsp;| 
<a href="https://drive.google.com/file/d/1q1KUrYaP-ajGO9_I93oM7qHhz8XeAhOy/view?usp=sharing">Poster</a>&nbsp;| 
<a href="https://drive.google.com/file/d/1kHxvQH0zwO9C4EJ0kiIPxfTdQukfr2O7/view?usp=sharing">http</a>&nbsp;| 
<a href="pubs_abs.html#CaiEtAl2020">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="WatanabeEtAl2020">6</a>]
</td>
<td class="bibtexitem">
Shinji Watanabe, Michael&nbsp;I Mandel, Jon Barker, and Emmanuel Vincent.
 CHiME-6 challenge: Tackling multispeaker speech recognition for
  unsegmented recordings, 2020.
[&nbsp;<a href="pubs_bib.html#WatanabeEtAl2020">bib</a>&nbsp;| 
<a href="http://arxiv.org/abs/2004.09249">arXiv</a>&nbsp;| 
<a href="pubs_abs.html#WatanabeEtAl2020">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandelEtAl2019b">7</a>]
</td>
<td class="bibtexitem">
Lauren Mandel, Michael&nbsp;I. Mandel, and Chris Streb.
 Soundscape ecology: How listening to the environment can shape design
  and planning.
 In <em>American Society for Landscape Architects Conference on
  Landscape Architecture</em>, San Diego, CA, 2019.
[&nbsp;<a href="pubs_bib.html#mandelEtAl2019b">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="NiAndMandel2019b">8</a>]
</td>
<td class="bibtexitem">
Zhaoheng Ni and Michael&nbsp;I Mandel.
 Onssen: an open-source speech separation and enhancement library.
 pages 7269--7273, 2020.
[&nbsp;<a href="pubs_bib.html#NiAndMandel2019b">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/ICASSP40776.2020.9054265">DOI</a>&nbsp;| 
<a href="http://arxiv.org/abs/1911.00982">arXiv</a>&nbsp;| 
<a href="pubs_abs.html#NiAndMandel2019b">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="GroverEtAl2018">9</a>]
</td>
<td class="bibtexitem">
Vikas Grover, Michael&nbsp;I Mandel, Valerie Shafer, Yusra Syed, and Austin Twine.
 Understanding acoustic cues non-native speakers use for identifying
  english /v/-/w/ using bubble noise method.
 In <em>ASHA Convention</em>, 2018.
[&nbsp;<a href="pubs_bib.html#GroverEtAl2018">bib</a>&nbsp;| 
<a href="https://plan.core-apps.com/asha2018/event/b70a14de3fdfa5add67a659fc86a3ef5">http</a>&nbsp;| 
<a href="pubs_abs.html#GroverEtAl2018">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="GhalyEtA2017">10</a>]
</td>
<td class="bibtexitem">
Hussein Ghaly and Michael&nbsp;I Mandel.
 Analyzing human and machine performance in resolving ambiguous spoken
  sentences.
 In <em>1st Workshop on Speech-Centric Natural Language Processing
  (SCNLP)</em>, pages 18--26, 2017.
[&nbsp;<a href="pubs_bib.html#GhalyEtA2017">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/scnlp17.pdf">.pdf</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="choi17">11</a>]
</td>
<td class="bibtexitem">
Jiyoung Choi and Michael&nbsp;I Mandel.
 Perception of korean fricatives and affricates in 'bubble' noise by
  native and nonnative speakers.
 In <em>International Circle of Korean Linguistics</em>, 2017.
[&nbsp;<a href="pubs_bib.html#choi17">bib</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel15b">12</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel and Nicoleta Roman.
 Integrating markov random fields and model-based expectation
  maximization source separation and localization.
 In <em>Acoustical Society of America Spring Meeting</em>, 2015.
[&nbsp;<a href="pubs_bib.html#mandel15b">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/asa15slides.pdf">Slides</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel15a">13</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel, Sarah&nbsp;E Yoho, and Eric&nbsp;W Healy.
 Listener consistency in identifying speech mixed with particular
  “bubble” noise instances.
 In <em>Acoustical Society of America Spring Meeting</em>, 2015.
[&nbsp;<a href="pubs_bib.html#mandel15a">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/asa15poster.pdf">Poster</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel14d">14</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I Mandel and Song&nbsp;Hui Chon.
 Using auditory bubbles to determine spectro-temporal cues of timbre.
 In <em>Cognitively Based Music Informatics Research (CogMIR)</em>, 2014.
[&nbsp;<a href="pubs_bib.html#mandel14d">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/cogmir14slides.pdf">Slides</a>&nbsp;| 
<a href="pubs_abs.html#mandel14d">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="nandi13">15</a>]
</td>
<td class="bibtexitem">
Arnab Nandi and Michael&nbsp;I Mandel.
 The interactive join: Recognizing gestures for database queries.
 In <em>CHI Works-In-Progress</em>, 2013.
[&nbsp;<a href="pubs_bib.html#nandi13">bib</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/chiwip13poster.pdf">Poster</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/chiwip13.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#nandi13">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel11a">16</a>]
</td>
<td class="bibtexitem">
Michael Mandel, Razvan Pascanu, Hugo Larochelle, and Yoshua Bengio.
 Autotagging music with conditional restricted boltzmann machines,
  March 2011.
[&nbsp;<a href="pubs_bib.html#mandel11a">bib</a>&nbsp;| 
<a href="http://arxiv.org/abs/1103.2832">arXiv</a>&nbsp;| 
<a href="http://arxiv.org/abs/1103.2832">http</a>&nbsp;| 
<a href="pubs_abs.html#mandel11a">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="mandel06a">17</a>]
</td>
<td class="bibtexitem">
Michael&nbsp;I. Mandel and Daniel P.&nbsp;W. Ellis.
 A probability model for interaural phase difference.
 In <em>ISCA Workshop on Statistical and Perceptual Audio
  Processing SAPA</em>, pages 1--6, 2006.
[&nbsp;<a href="pubs_bib.html#mandel06a">bib</a>&nbsp;| 
<a href="http://www.isca-speech.org/archive/sapa_2006/sap6_001.html">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/sapa06slides.pdf">Slides</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/sapa06.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#mandel06a">Abstract</a>&nbsp;]

</td>
</tr>


<tr valign="top">
<td align="right" class="bibtexnumber">
[<a name="sudderth04">18</a>]
</td>
<td class="bibtexitem">
Erik&nbsp;B. Sudderth, Michael&nbsp;I. Mandel, William&nbsp;T. Freeman, and Alan&nbsp;S. Willsky.
 Visual hand tracking using nonparametric belief propagation.
 In <em>Proceedings of the IEEE Conference on Computer Vision and
  Pattern Recognition Workshops</em>, pages 189--197, 2004.
[&nbsp;<a href="pubs_bib.html#sudderth04">bib</a>&nbsp;| 
<a href="http://dx.doi.org/10.1109/CVPR.2004.200">DOI</a>&nbsp;| 
<a href="http://ssg.mit.edu/nbp/">Demo</a>&nbsp;| 
<a href="http://m.mr-pc.org/work/gmbv04.pdf">.pdf</a>&nbsp;| 
<a href="pubs_abs.html#sudderth04">Abstract</a>&nbsp;]

</td>
</tr>
</table>