<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Karthikeyan, Muthukumarasamy</style></author><author><style face="normal" font="default" size="100%">Bender, Andreas</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Encoding and decoding graphical chemical structures as two-dimensional (PDF417) barcodes</style></title><secondary-title><style face="normal" font="default" size="100%">Journal of Chemical Information and Modeling</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2005</style></year><pub-dates><date><style  face="normal" font="default" size="100%">MAY</style></date></pub-dates></dates><number><style face="normal" font="default" size="100%">3</style></number><publisher><style face="normal" font="default" size="100%">AMER CHEMICAL SOC</style></publisher><pub-location><style face="normal" font="default" size="100%">1155 16TH ST, NW, WASHINGTON, DC 20036 USA</style></pub-location><volume><style face="normal" font="default" size="100%">45</style></volume><pages><style face="normal" font="default" size="100%">572-580</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;A wide range of molecular representations exist today, ranging from human-readable structural diagrams over line notations such as Wiswesser Line Notation (WLN) and SMILES to several dozen computer-readable file formats. Still, to encode molecular structures in a computer-readable way for inputting structures in computer systems those formats are not the method of choice since they are not easily and faultlessly readable via optical recognition. In the present study a two-dimensional (PDF417) barcode representation of molecular structures in SMILES format is explored that enables the user to read and input molecular structures into computer systems in a fully automated fashion. A Lempel-Ziv-Welch (LZW) based compressed version of SMILES is suggested for cases where the size of the structure exceeds the storage capacity of PDF417 barcodes. Alternatively, the compact ACS format may be employed as a structural representation. The input via barcodes is fast, practically error free due to the 2D barcodes used which employ error correction and fully automatic. A Web application interface is developed which is able to interpret these barcodes and export them as optimized 3D chemical structures. Applications of this representation range from keeping automated storage systems to Web-based tracking systems of molecular samples. The National Chemical Laboratory, Pune, employs 2D barcode encoded structures for in-house repository management, where barcodes can also be used for querying the database for similar or substructures of the query structure.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">3</style></issue><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">&lt;p&gt;Foreign&lt;/p&gt;</style></custom3><custom4><style face="normal" font="default" size="100%">&lt;p&gt;3.657&lt;/p&gt;</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Karthikeyan, Muthukumarasamy</style></author><author><style face="normal" font="default" size="100%">Glen, R. C.</style></author><author><style face="normal" font="default" size="100%">Bender, Andreas</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">General melting point prediction based on a diverse compound data set and artificial neural networks</style></title><secondary-title><style face="normal" font="default" size="100%">Journal of Chemical Information and Modeling</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2005</style></year><pub-dates><date><style  face="normal" font="default" size="100%">MAY</style></date></pub-dates></dates><number><style face="normal" font="default" size="100%">3</style></number><publisher><style face="normal" font="default" size="100%">AMER CHEMICAL SOC</style></publisher><pub-location><style face="normal" font="default" size="100%">1155 16TH ST, NW, WASHINGTON, DC 20036 USA</style></pub-location><volume><style face="normal" font="default" size="100%">45</style></volume><pages><style face="normal" font="default" size="100%">581-590</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;We report the development of a robust and general model for the prediction of melting points. It is based on a diverse data set of 4173 compounds and employs a large number of 2D and 3D descriptors to capture molecular physicochemical and other graph-based properties. Dimensionality reduction is performed by principal component analysis, while a fully connected feed-forward back-propagation artificial neural network is employed for model generation. The melting point is a fundamental physicochemical property of a molecule that is controlled by both single-molecule properties and intermolecular interactions due to packing in the solid state. Thus, it is difficult to predict, and previously only melting point models for clearly defined and smaller compound sets have been developed. Here we derive the first general model that covers a comparatively large and relevant part of organic chemical space. The final model is based on 2D descriptors, which are found to contain more relevant information than the 3D descriptors calculated. Internal random validation of the model achieves a correlation coefficient of R-2 = 0.661 with an average absolute error of 37.6 degrees C. The model is internally consistent with a correlation coefficient of the test set of Q(2) = 0.658 (average absolute error 38.2 degrees C) and a correlation coefficient of the internal validation set of Q(2) = 0.645 (average absolute error 39.8 degrees C). Additional validation was performed on an external drug data set consisting of 277 compounds. On this external data set a correlation coefficient of Q(2) = 0.662 (average absolute error 32.6 degrees C) was achieved, showing ability of the model to generalize. Compared to an earlier model for the prediction of melting points of druglike compounds our model exhibits slightly improved performance, despite the much larger chemical space covered. The remaining model error is due to molecular properties that are not captured using single-molecule based descriptors, namely both inter- and intramolecular interactions and crystal packing, for which examples of and reasons for outliers are given.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">3</style></issue><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">Foreign</style></custom3><custom4><style face="normal" font="default" size="100%">3.657</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Karthikeyan, Muthukumarasamy</style></author><author><style face="normal" font="default" size="100%">Krishnan, S.</style></author><author><style face="normal" font="default" size="100%">Pandey, A. K.</style></author><author><style face="normal" font="default" size="100%">Bender, Andreas</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Harvesting chemical information from the Internet using a distributed approach: chemxtreme</style></title><secondary-title><style face="normal" font="default" size="100%">Journal of Chemical Information and Modeling</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">calcination</style></keyword><keyword><style  face="normal" font="default" size="100%">clinker</style></keyword><keyword><style  face="normal" font="default" size="100%">one-dimensional model</style></keyword><keyword><style  face="normal" font="default" size="100%">rotary cement kiln</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2006</style></year><pub-dates><date><style  face="normal" font="default" size="100%">MAR</style></date></pub-dates></dates><number><style face="normal" font="default" size="100%">2</style></number><publisher><style face="normal" font="default" size="100%">ASC, CINF; CSA Trust; CSJ; GDCh; KNCV</style></publisher><pub-location><style face="normal" font="default" size="100%">1155 16TH ST, NW, WASHINGTON, DC 20036 USA</style></pub-location><volume><style face="normal" font="default" size="100%">46</style></volume><pages><style face="normal" font="default" size="100%">452-461</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;{The Internet is a comprehensive resource of chemical information which is at the same time largely unstructured. It provides a wealth of scientific information such as experimental data and requires a suitable automated data mining and analysis tool for its meaningful exploration. The Java based software presented here, ChemXtreme, is developed for harvesting chemical information from the Internet employing the Google API in combination with a distributed client/server text analysis architecture based on JavaRMI. It represents the first and until now the only toolkit for automated structured data retrieval from the Internet which is itself open source. ChemXtreme employs the ``search the search engine''&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">2</style></issue><work-type><style face="normal" font="default" size="100%">Article</style></work-type><notes><style face="normal" font="default" size="100%">7th International Conference on Chemical Structures, Noordwijkerhout, NETHERLANDS, JUN 05-09, 2005</style></notes><custom3><style face="normal" font="default" size="100%">&lt;p&gt;Foreign&lt;/p&gt;</style></custom3><custom4><style face="normal" font="default" size="100%">&lt;p&gt;3.657&lt;/p&gt;</style></custom4></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Karthikeyan, Muthukumarasamy</style></author><author><style face="normal" font="default" size="100%">Krishnan, S.</style></author><author><style face="normal" font="default" size="100%">Pandey, Anil Kumar</style></author><author><style face="normal" font="default" size="100%">Bender, Andreas</style></author><author><style face="normal" font="default" size="100%">Tropsha, Alexander</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Distributed chemical computing using chemstar: an open source java remote method invocation architecture applied to large scale molecular data from pubchem</style></title><secondary-title><style face="normal" font="default" size="100%">Journal of Chemical Information and Modeling</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2008</style></year><pub-dates><date><style  face="normal" font="default" size="100%">APR</style></date></pub-dates></dates><number><style face="normal" font="default" size="100%">4</style></number><publisher><style face="normal" font="default" size="100%">AMER CHEMICAL SOC</style></publisher><pub-location><style face="normal" font="default" size="100%">1155 16TH ST, NW, WASHINGTON, DC 20036 USA</style></pub-location><volume><style face="normal" font="default" size="100%">48</style></volume><pages><style face="normal" font="default" size="100%">691-703</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;present the application of a Java remote method invocation (RMI) based open source architecture to distributed chemical computing. This architecture was previously employed for distributed data harvesting of chemical information from the Internet via the Google application programming interface (API; ChemXtreme). Due to its open source character and its flexibility, the underlying server/client framework can be quickly adopted to virtually every computational task that can be parallelized. Here, we present the server/client communication framework as well as an application to distributed computing of chemical properties on a large scale (currently the size of PubChem; about 18 million compounds), using both the Marvin toolkit as well as the open source JOELib package. As an application, for this set of compounds, the agreement of log P and TPSA between the packages was compared. Outliers were found to be mostly non-druglike compounds and differences could usually be explained by differences in the underlying algorithms. ChemStar is the first open source distributed chemical computing environment built on Java RMI, which is also easily adaptable to user demands due to its ``plug-in architecture''. The complete source codes as well as calculated properties along with links to PubChem resources are available on the Internet via a graphical user interface at http://moltable.nel.res.in/chemstar/.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">4</style></issue><work-type><style face="normal" font="default" size="100%">Article</style></work-type><custom3><style face="normal" font="default" size="100%">Foreign</style></custom3><custom4><style face="normal" font="default" size="100%">3.657</style></custom4></record></records></xml>