TUHH Open Research
Help
  • Log In
    New user? Click here to register.Have you forgotten your password?
  • English
  • Deutsch
  • Communities & Collections
  • Publications
  • Research Data
  • People
  • Institutions
  • Projects
  • Statistics
  1. Home
  2. TUHH
  3. Publication References
  4. Accurately predicting solubility curves via a thermodynamic cycle, machine learning, and solvent ensembles
 
Options

Accurately predicting solubility curves via a thermodynamic cycle, machine learning, and solvent ensembles

Publikationstyp
Journal Article
Date Issued
2025-11-21
Sprache
English
Author(s)
Al Ibrahim, Emad
Morgan, Nathan  
Müller, Simon  orcid-logo
Thermische Verfahrenstechnik V-8  
Motati, Saikiran  
Green, William  
TORE-URI
https://hdl.handle.net/11420/60267
Journal
Journal of the American Chemical Society  
Volume
147
Issue
49
Start Page
45057
End Page
45069
Citation
Journal of the American Chemical Society 147 (49): 45057-45069 (2025)
Publisher DOI
10.1021/jacs.5c13746
Scopus ID
2-s2.0-105024735412
Publisher
American Chemical Society (ACS)
Determining solubilities of organic molecules is critical in various fields such as pharmaceuticals, agrochemicals, and environmental science. Knowing how a solute will dissolve in different solvents and at different temperatures is essential for drug formulation, synthesis, purification, and crystallization. Hard-to-estimate solubility limits currently hinder the design of new processes, making innovation more expensive. We propose a fast and general method for predicting the solubilities of neutral organic molecules in a wide range of solvents and temperatures. Our method uses a thermodynamic fusion cycle to combine machine learning predictions of the activity coefficient, fusion enthalpy, and melting point temperature. This method was tested on a combined data set with more than 100,000 experimental solubility values, showing better or comparable performance to competing methods on many solubility benchmarks even at elevated temperatures. We also introduce reference ensembling to leverage all available experimental solubilities for a given solute in estimating its solubility in a different solvent. Reference ensembling is also shown to enhance the robustness of models trained directly on solubility data.
Subjects
Activity coefficient
Solubility
Solution chemistry
Solvents
Thermodynamic properties
DDC Class
540: Chemistry
TUHH
Weiterführende Links
  • Contact
  • Send Feedback
  • Cookie settings
  • Privacy policy
  • Impress
DSpace Software

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science
Design by effective webwork GmbH

  • Deutsche NationalbibliothekDeutsche Nationalbibliothek
  • ORCiD Member OrganizationORCiD Member Organization
  • DataCiteDataCite
  • Re3DataRe3Data
  • OpenDOAROpenDOAR
  • OpenAireOpenAire
  • BASE Bielefeld Academic Search EngineBASE Bielefeld Academic Search Engine
Feedback