Topic modeling for conference analytics
Refereed conference paper presented and published in conference proceedings


全文

其它資訊
摘要This work presents our attempt to understand the research topics that characterize the papers submitted to a conference, by using topic modeling and data visualization techniques. We infer the latent topics from the abstracts of all the papers submitted to Interspeech2014 by means of Latent Dirichlet Allocation. Pertopic word distributions thus obtained are visualized through word clouds. We also compare the automatically inferred topics against the expert-defined topics (also known as tracks for Interspeech2014). The comparison is based on an information retrieval framework, where we use each latent topic as a query and each track as a document. For each latent topic, we retrieve a ranked list of tracks scored by the degree of word overlap. Each latent topic is associated with the top-scoring track. This analytic procedure was applied to all submissions to Interspeech2014 and sheds some interesting light in terms of providing an overview of topic categorization in the conference, popular versus unpopular topics, emerging topics and topic compositions. Such insights are potentially valuable for understanding the technical content of a field and planning the future development of its conference(s).
著者Liu P., Jameel S., Lam W., Ma B., Meng H.
會議名稱16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015
會議開始日06.09.2015
會議完結日10.09.2015
會議地點Dresden
會議國家/地區德國
出版年份2015
月份1
日期1
卷號2015-January
頁次707 - 711
國際標準期刊號1990-9772
語言英式英語
關鍵詞Conference analytics, Information retrieval, Topic modeling

上次更新時間 2020-01-09 於 00:05