<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Modee, Rohit</style></author><author><style face="normal" font="default" size="100%">Agarwal, Sheena</style></author><author><style face="normal" font="default" size="100%">Verma, Ashwini</style></author><author><style face="normal" font="default" size="100%">Joshi, Kavita</style></author><author><style face="normal" font="default" size="100%">Priyakumar, U. Deva</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">DART: deep learning enabled topological interaction model for energy prediction of metal clusters and its application in identifying unique low energy isomers</style></title><secondary-title><style face="normal" font="default" size="100%">Physical Chemistry Chemical Physics</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">OCT </style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">23</style></volume><pages><style face="normal" font="default" size="100%">21995-22003</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Recently, machine learning (ML) has proven to yield fast and accurate predictions of chemical properties to accelerate the discovery of novel molecules and materials. The majority of the work is on organic molecules, and much more work needs to be done for inorganic molecules, especially clusters. In the present work, we introduce a simple topological atomic descriptor called TAD, which encodes chemical environment information of each atom in the cluster. TAD is a simple and interpretable descriptor where each value represents the atom count in three shells. We also introduce the DART deep learning enabled topological interaction model, which uses TAD as a feature vector to predict energies of metal clusters, in our case gallium clusters with sizes ranging from 31 to 70 atoms. The DART model is designed based on the principle that the energy is a function of atomic interactions and allows us to model these complex atomic interactions to predict the energy. We further introduce a new dataset called GNC_31-70, which comprises structures and DFT optimized energies of gallium clusters with sizes ranging from 31 to 70 atoms. We show how DART can be used to accelerate the process of identification of low energy structures without geometry optimization. Albeit using a topological descriptor, DART achieves a mean absolute error (MAE) of 3.59 kcal mol(-1) (0.15 eV) on the test set. We also show that our model can distinguish core and surface atoms in the Ga-70 cluster, which the model has never encountered earlier. Finally, we demonstrate the transferability of the DART model by predicting energies for about 6k unseen configurations picked up from molecular dynamics (MD) data for three cluster sizes (46, 57, and 60) within seconds. The DART model was able to reduce the load on DFT optimizations while identifying unique low energy structures from MD data.</style></abstract><issue><style face="normal" font="default" size="100%">38</style></issue><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">Foreign</style></custom3><custom4><style face="normal" font="default" size="100%">3.676</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Modee, Rohit</style></author><author><style face="normal" font="default" size="100%">Verma, Ashwini</style></author><author><style face="normal" font="default" size="100%">Joshi, Kavita</style></author><author><style face="normal" font="default" size="100%">Priyakumar, U. Deva</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">MeGen-generation of gallium metal clusters using reinforcement learning</style></title><secondary-title><style face="normal" font="default" size="100%">Machine Learning-Science and Technology</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">gallium clusters</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement learning</style></keyword><keyword><style  face="normal" font="default" size="100%">structure generation</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2023</style></year><pub-dates><date><style  face="normal" font="default" size="100%">JUN </style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">4</style></volume><pages><style face="normal" font="default" size="100%">025032</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	The generation of low-energy 3D structures of metal clusters depends on the efficiency of the search algorithm and the accuracy of inter-atomic interaction description. In this work, we formulate the search algorithm as a reinforcement learning (RL) problem. Concisely, we propose a novel actor-critic architecture that generates low-lying isomers of metal clusters at a fraction of computational cost than conventional methods. Our RL-based search algorithm uses a previously developed DART model as a reward function to describe the inter-atomic interactions to validate predicted structures. Using the DART model as a reward function incentivizes the RL model to generate low-energy structures and helps generate valid structures. We demonstrate the advantages of our approach over conventional methods for scanning local minima on potential energy surface. Our approach not only generates isomer of gallium clusters at a minimal computational cost but also predicts isomer families that were not discovered through previous density-functional theory (DFT)-based approaches.&lt;/p&gt;
</style></abstract><issue><style face="normal" font="default" size="100%">2</style></issue><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">&lt;p&gt;
	foreign&lt;/p&gt;
</style></custom3><custom4><style face="normal" font="default" size="100%">&lt;p&gt;
	6.8&lt;/p&gt;
</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Wilson, Nikhil</style></author><author><style face="normal" font="default" size="100%">Verma, Ashwini</style></author><author><style face="normal" font="default" size="100%">Maharana, Piyush Ranjan</style></author><author><style face="normal" font="default" size="100%">Sahoo, Ameeya Bhusan</style></author><author><style face="normal" font="default" size="100%">Joshi, Kavita</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">HyStor: an experimental database of hydrogen storage properties for various metal alloy classes</style></title><secondary-title><style face="normal" font="default" size="100%">International Journal of Hydrogen Energy</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Databases</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Metal hydrides</style></keyword><keyword><style  face="normal" font="default" size="100%">Solid-state hydrogen storage</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2024</style></year><pub-dates><date><style  face="normal" font="default" size="100%">NOV </style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">90</style></volume><pages><style face="normal" font="default" size="100%">460-469</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	In this work, we introduce the HyStor database, consisting of 1282 metal alloys along with their maximum hydrogen storage capacity (H2wt%) at a given absorption temperature. The curated HydPark database consist of 831 entries. We sourced compositions from research articles and various patent documents, resulting in addition of 451 compositions to the HydPark database. The addition is reflected in the data across all existing classes of alloys. Further, low entropy alloys (LEA), medium entropy alloys (MEA) and high entropy alloys (HEA) have been newly included classes. This has broadened the scope of the database to encompass the latest materials of interest for hydrogen storage. HyStor contains representation of 54 elements, with a temperature range of 200-800 K, and H2wt% ranging from 0.1 to 7.19. We conducted thorough checks for duplicate entries, erroneous data, and conflicting compositions within the database to ensure data quality. Furthermore, we conducted multiple tests to identify potential outlier compositions. The data curation and updation reflects into slight improved error metrics of the HYST model, reducing the Mean Absolute Error (MAE) from 0.31 to 0.29 and increasing the R2 score from 0.77 to 0.79.&lt;/p&gt;
</style></abstract><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">&lt;p&gt;
	Foreign&lt;/p&gt;
</style></custom3><custom4><style face="normal" font="default" size="100%">&lt;p&gt;
	7.2&lt;/p&gt;
</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Verma, Ashwini</style></author><author><style face="normal" font="default" size="100%">Wilson, Nikhil</style></author><author><style face="normal" font="default" size="100%">Joshi, Kavita</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Solid state hydrogen storage: Decoding the path through machine learning</style></title><secondary-title><style face="normal" font="default" size="100%">International Journal of Hydrogen Energy </style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Enthalpy of hydride formation</style></keyword><keyword><style  face="normal" font="default" size="100%">Hydrogen storage capacity</style></keyword><keyword><style  face="normal" font="default" size="100%">Metal hydrides</style></keyword><keyword><style  face="normal" font="default" size="100%">Predictive machine learning models</style></keyword><keyword><style  face="normal" font="default" size="100%">Solid-state hydrogen storage</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2024</style></year><pub-dates><date><style  face="normal" font="default" size="100%">JAN</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">50</style></volume><pages><style face="normal" font="default" size="100%">1518-1528</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;We present a machine learning (ML) framework HEART (HydrogEn storAge propeRty predicTor) for identifying suitable families of metal alloys for hydrogen storage under ambient conditions. Our framework includes two ML models that predict the hydrogen storage capacity (HYST) and the enthalpy of hydride formation (THOR) of multi-component metal alloys. We demonstrate that a chemically diverse set of features effectively describes the hydrogen storage properties of the alloys. In HYST, we use absorption temperature as a feature which improved H2wt% prediction significantly. For out-of-the-bag samples, HYST predicted H2wt% with R2 score of 0.81 and mean absolute error (MAE) of 0.45 wt% whereas R2 score is 0.89 and MAE is 4.53 kJ/molH2 for THOR. These models are further employed to predict H2wt% and Delta H for similar to 6.4 million multi-component metal alloys. We have identified 6480 compositions with superior storage properties (H2wt% &amp;gt; 2.5 at room temperature and Delta H &amp;lt; 60 kJ/molH2). We have also discussed in detail the interesting trends picked up by these models like temperature dependent variation in the rate of hydrogenation and alloying effect on H2wt% and Delta H in different families of alloys. Importantly certain elements like Al, Si, Sc, Cr, and Mn when mixed in small fractions with hydriding elements like Mg, Ti, V etc. systematically reduce Delta H without significantly compromising the storage capacity. Further upon increasing the number of elements in the alloy i.e from binary to ternary to quaternary, the number of compositions with lower enthalpies also increases. From the 6.4 million compositions, we have reported new alloy families having potential for hydrogen storage at room temperature. Finally, we demonstrate that HEART has the potential to scan vast chemical spaces by narrowing down potential materials for hydrogen storage.&lt;/p&gt;
</style></abstract><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">&lt;p&gt;Foreign&lt;/p&gt;
</style></custom3><custom4><style face="normal" font="default" size="100%">&lt;p&gt;7.2&lt;/p&gt;
</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Verma, Ashwini</style></author><author><style face="normal" font="default" size="100%">Joshi, Kavita</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">MH-PCTpro: a machine learning model for rapid prediction of pressure-composition-temperature (PCT) isotherms</style></title><secondary-title><style face="normal" font="default" size="100%">Iscience</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2025</style></year><pub-dates><date><style  face="normal" font="default" size="100%">APR </style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">28</style></volume><pages><style face="normal" font="default" size="100%">112251</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	We present a machine-learning powered Metal Hydride's Pressure-Composition-Temperature isotherm Predictor (MH-PCTpro) for metal compositions. To train the MH-PCTpro, an experimental database of PCT isotherms is built from published literature. The database comprises over 14,000 data points extracted from 237 PCT isotherms representing 138 distinct compositions. The dataset encompasses more than 25 elements and spans a broad spectrum of absorption temperatures (263-653 K) and hydrogen pressures (0.001-40 MPa). The model is validated on a wide range of alloy families and its predictions are consistent with experimental results. The model also captures temperature-dependent variations in plateau pressure, enabling determination of enthalpy and entropy of hydride formation through Van't Hoff plots. Hence, MH-PCTpro can be used as an ML tool for guiding PCT experiments, offering PCT isotherm predictions and valuable thermodynamic insights into materials suitable for solid-state hydrogen storage.&lt;/p&gt;
</style></abstract><issue><style face="normal" font="default" size="100%">4</style></issue><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">&lt;p&gt;
	Foreign&lt;/p&gt;
</style></custom3><custom4><style face="normal" font="default" size="100%">&lt;p&gt;
	NA&lt;/p&gt;
</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Maharana, Piyush Ranjan</style></author><author><style face="normal" font="default" size="100%">Verma, Ashwini</style></author><author><style face="normal" font="default" size="100%">Joshi, Kavita</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Retrieval augmented generation for building datasets from scientific literature</style></title><secondary-title><style face="normal" font="default" size="100%">Journal of Physics-Materials</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">dataset building</style></keyword><keyword><style  face="normal" font="default" size="100%">Hydrogen storage</style></keyword><keyword><style  face="normal" font="default" size="100%">LLM</style></keyword><keyword><style  face="normal" font="default" size="100%">materials</style></keyword><keyword><style  face="normal" font="default" size="100%">RAG</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2025</style></year><pub-dates><date><style  face="normal" font="default" size="100%">JUL</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">8</style></volume><pages><style face="normal" font="default" size="100%">035006</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	In this work, we show that employing retrieval augmented generation (RAG) with a large language model (LLM) enables us to extract accurate data from scientific literature and construct datasets. The rapid growth in publications necessitates the automation of extraction of structured data as it is crucial for training machine learning(ML) models. The pipeline developed is simple and can be adjusted accordingly with natural language as input. Quantization enables us to run LLMs on consumer hardware and remove the reliance on closed-source models. Both Llama3-8B and Gemma2-9B with RAG give structured output consistently and with high accuracy as compared to direct prompting. Using the newly developed protocol, we created a data set of metal hydrides for solid-state hydrogen storage from paper abstracts. The accuracy of the generated dataset was &amp;gt;88% in the cases tested. Further, we demonstrate that the generated dataset is ready-to-use for ML models by testing it with HYST to predict the H(2)wt\textbackslash% at a given temperature. Thus, we demonstrate a pipeline to create datasets from scientific literature at minimal computational cost and high accuracy.&lt;/p&gt;
</style></abstract><issue><style face="normal" font="default" size="100%">3</style></issue><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">&lt;p&gt;
	Foreign&lt;/p&gt;
</style></custom3><custom4><style face="normal" font="default" size="100%">&lt;p&gt;
	4.3&lt;/p&gt;
</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Verma, Ashwini</style></author><author><style face="normal" font="default" size="100%">Joshi, Kavita</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">What drives property prediction for solid-state hydrogen storage? data or smart features?</style></title><secondary-title><style face="normal" font="default" size="100%">International Journal of Hydrogen Energy</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Experimental data</style></keyword><keyword><style  face="normal" font="default" size="100%">Feature engineering</style></keyword><keyword><style  face="normal" font="default" size="100%">Hydrogen storage</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Metal hydrides</style></keyword><keyword><style  face="normal" font="default" size="100%">Property prediction</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2026</style></year><pub-dates><date><style  face="normal" font="default" size="100%">APR </style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">226</style></volume><pages><style face="normal" font="default" size="100%">154499</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	Metal hydrides play a pivotal role in a wide range of applications, including hydrogen storage, compression, heat management, and catalysis, making them a central focus of interdisciplinary research spanning chemistry, materials science, and engineering. The performance of the metal hydride-based systems is strongly governed by the thermodynamics of metal-hydrogen interactions. Among key thermodynamic properties, the equilibrium plateau pressure (P-eq) is particularly critical, as it defines operating conditions for hydrogen absorption and desorption. Traditionally, determining P-eq requires extensive experimental measurements, which limits the pace of materials discovery. On the other hand, predicting it through ML-based models is constrained by the availability of limited data. In this work, we demonstrate that smart features can be a way to overcome this limitation. EquiP, an ML model trained to predict ln(P-eq) as a function of temperature, generates Van't Hoff plots (P-eq vs. 1/T), enabling rapid determination of enthalpy and entropy of hydride formation. We demonstrate that incorporating structural descriptors derived from X-ray diffraction (XRD) data improves the performance of the model, particularly with sparse training datasets. A model trained using only compositional descriptors yields a validation mean absolute error (MAE) of 0.21 bar, whereas incorporating XRD features reduces the MAE substantially to 0.07 bar. This work demonstrates that with limited data, intelligent feature design grounded in domain knowledge is the key to improving predictions of complex material properties.&lt;/p&gt;
</style></abstract><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">&lt;p&gt;
	Foreign&lt;/p&gt;
</style></custom3><custom4><style face="normal" font="default" size="100%">&lt;p&gt;
	8.3&lt;/p&gt;
</style></custom4></record></records></xml>