TUHH Open Research
Help
  • Log In
    New user? Click here to register.Have you forgotten your password?
  • English
  • Deutsch
  • Communities & Collections
  • Publications
  • Research Data
  • People
  • Institutions
  • Projects
  • Statistics
  1. Home
  2. TUHH
  3. Publication References
  4. Adaptive Window Strategy for Topic Modeling in Document Streams
 
Options

Adaptive Window Strategy for Topic Modeling in Document Streams

Publikationstyp
Conference Paper
Date Issued
2018-07
Sprache
English
Author(s)
Murena, Pierre Alexandre  
Al-Ghossein, Marie  
Abdessalem, Talel  
CornuĂ©jols, Antoine  
TORE-URI
http://hdl.handle.net/11420/15253
Article Number
8489771
Citation
2018 International Joint Conference on Neural Networks (IJCNN 2018)
Contribution to Conference
2018 International Joint Conference on Neural Networks, IJCNN 2018  
Publisher DOI
10.1109/IJCNN.2018.8489771
Scopus ID
2-s2.0-85056490837
Extracting global themes from a written text has recently become a major issue for computational intelligence, in particular in Natural Language Processing communities. Among all proposed solutions, Latent Dirichlet Allocation (LDA) has gained a vast interest and several variants have been proposed to adapt to changing environments. With the emergence of data streams, for instance from social media, the domain faces a new challenge: Topic extraction in real time. In this paper, we propose a simple approach called Adaptive Window based Incremental LDA (AWILDA) originating from the cross-over between LDA and state-of-the-art methods in data stream mining. We train new topic models only when a drift is detected and select training data on the fly using ADWIN algorithm. We provide both theoretical guarantees for our method and experimental validation on artificial and real-world data.
TUHH
WeiterfĂĽhrende Links
  • Contact
  • Send Feedback
  • Cookie settings
  • Privacy policy
  • Impress
DSpace Software

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science
Design by effective webwork GmbH

  • Deutsche NationalbibliothekDeutsche Nationalbibliothek
  • ORCiD Member OrganizationORCiD Member Organization
  • DataCiteDataCite
  • Re3DataRe3Data
  • OpenDOAROpenDOAR
  • OpenAireOpenAire
  • BASE Bielefeld Academic Search EngineBASE Bielefeld Academic Search Engine
Feedback