TUHH Open Research
Help
  • Log In
    New user? Click here to register.Have you forgotten your password?
  • English
  • Deutsch
  • Communities & Collections
  • Publications
  • Research Data
  • People
  • Institutions
  • Projects
  • Statistics
  1. Home
  2. TUHH
  3. Publication References
  4. Active learning of potential-energy surfaces of weakly bound complexes with regression-tree ensembles
 
Options

Active learning of potential-energy surfaces of weakly bound complexes with regression-tree ensembles

Publikationstyp
Journal Article
Date Issued
2021-10-14
Sprache
English
Author(s)
Saleh, Yahya
Sanjay, Vishnu  
Iske, Armin  
Yachmenev, Andrey  
Küpper, Jochen  
TORE-URI
https://hdl.handle.net/11420/47822
Journal
The Journal of Chemical Physics  
Volume
155
Issue
14
Article Number
144109
Citation
Journal of Chemical Physics 155 (14): 144109 (2021-10-14)
Publisher DOI
10.1063/5.0057051
Scopus ID
2-s2.0-85117135136
ISSN
00219606
Several pool-based active learning (AL) algorithms were employed to model potential-energy surfaces (PESs) with a minimum number of electronic structure calculations. Theoretical and empirical results suggest that superior strategies can be obtained by sampling molecular structures corresponding to large uncertainties in their predictions while at the same time not deviating much from the true distribution of the data. To model PESs in an AL framework, we propose to use a regression version of stochastic query by forest, a hybrid method that samples points corresponding to large uncertainties while avoiding collecting too many points from sparse regions of space. The algorithm is implemented with decision trees that come with relatively small computational costs. We empirically show that this algorithm requires around half the data to converge to the same accuracy in comparison to the uncertainty-based query-by-committee algorithm. Moreover, the algorithm is fully automatic and does not require any prior knowledge of the PES. Simulations on a 6D PES of pyrrole(H2O) show that <15 000 configurations are enough to build a PES with a generalization error of 16 cm−1, whereas the final model with around 50 000 configurations has a generalization error of 11 cm−1
Subjects
MLE@TUHH
DDC Class
005: Computer Programming, Programs, Data and Security
TUHH
Weiterführende Links
  • Contact
  • Send Feedback
  • Cookie settings
  • Privacy policy
  • Impress
DSpace Software

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science
Design by effective webwork GmbH

  • Deutsche NationalbibliothekDeutsche Nationalbibliothek
  • ORCiD Member OrganizationORCiD Member Organization
  • DataCiteDataCite
  • Re3DataRe3Data
  • OpenDOAROpenDOAR
  • OpenAireOpenAire
  • BASE Bielefeld Academic Search EngineBASE Bielefeld Academic Search Engine
Feedback