CINF 42 |
| We present new tools and services developed by the CADD Group, NCI, for searching for structures in very large databases, such as very large screening sample collections. One of these tools is a service for very rapid structure lookup, making use of InChIs as well as CACTVS hash code-based identifiers. These latter, designed to allow one take into account tautomerism, different resonance structures drawn for charged species, and presence of additional fragments, enable fine-tunable yet rapid compound identification and database overlap analyses. We also present a powerful substructure search tool, implemented in the form of a web service, for databases of millions of compounds, using a search engine operating in distributed mode across a Linux cluster. Finally, a tool for automatic generation of a web interface, for searches by substructure and other criteria, from a database file, e.g. an SDF, is presented. Some of these tools and services are being made publicly available on the CADD Group's web server. |
|
Challenges in Structure Searching
1:00 PM-5:00 PM, Monday, 11 September 2006 Moscone Center -- Room 122, Oral
Division of Chemical Information |