March 28, 2007

Virtual Screening & Docking - Comparative Methodology & Best Practice

Considerable uncertainty currently exists in the performance and comparison of different virtual screening and docking methods on different targets and problems.  There is a need for integrated high quality data to be made available for benchmarking. Furthermore numerous practice and methodological weaknesses exist in current screening and docking comparison studies which require collaborative development of new practices and methods of comparison for objective evaluation of different screening and docking methods. 


To address these issues we are holding a forum on 15 October 2007:

Virtual Screening & Docking - Comparative Methodology & Best Practice
to take place at the Community of Practice Meeting, Autumn 2007
a joint InnovationWell and eCheminfo InterAction Meeting

Bryn Mawr College, Philadelphia

http://www.echeminfo.com/COMTY_conferences

Additionally, the workshop will be supported by pre- and post-event virtual communications.

This forum and workshop will have an agenda developed by workshop leaders to address ways forward for cooperation in developing a framework for comparative methodology and best practice approaches in screening and docking methods and including discussion of the following topics:

  • statistically significant relationships between docking scores and ligand affinity
  • practices and procedures for the operation of community-based screening and docking comparisons including tests and interpretation of results, in a way that everyone can agree is fair.
  • use of wiki-based approaches for practice development
  • peer review, data compilation, running of programs, judgement of results
  • workflow descriptions for comparisons
  • beyond conformational energetics in the rank ordering of diverse compounds in high throughput virtual screening
  • measurement and benchmarking
  • binding mode prediction, virtual screening for lead identification, rank-ordering by affinity for lead optimization
  • atom typing, ligand preparation (ionic forms, tautomers, ...), ligand conformer generation, protein preparation (protonation, residue orientation, ...), ligand placement (top-down, bottom-up, fragment based, group based, ...), energy calculation (force field type, grid type, algorithm, ...), constraint handling (global and local optimization strategy? process to escape local minima?), scoring (single-objective, multi-objective, consensus, ...)
  • separation of test set information from model development
  • validation datasets, results and applicability domains
  • objective comparisons of standardized test datasets
  • actions for data integration and knowledge sharing between initiatives
  • the role of semantic web approaches in uniting structured data from multiple resources
  • the role of natural language processing for processing unstructured information
  • extraction of data from the scientific literature
  • methods and procedures for secure testing of commercial data that could be acceptable to industry
  • frameworks for computational model testing and validation
  • impact of knowledge management approaches
  • collaboration and community support structures and environments

The agenda of the forum and workshop will be designed by a set of workshop leaders so as to maximise interaction, discussion, issue resolution, and action plans for cooperation.  Virtual communication and collaboration approaches will be used pre- and post-event to increase the benefit of the workshop activity.  In preparation for the workshop a form of peer review, as hosted on a wiki, will be applied to all stages of a proposed methodology for comparison studies.  As an outcome of the workshop a subsequent fair "competition", including the design of the experiment, collection of the data, running of programs, and the analysis of the results, will be defined for subsequent execution.

Workshop leaders will be invited who are active in the area of screening and docking method development and application and resource creation and will include government, industry and academic representatives.

Barry Hardy

eCheminfo Community of Practice Manager

March 02, 2006

Training Week - Latest Advances in Drug Discovery Design & Planning Methods

We are holding a hands-on 5 Day eCheminfo Advanced Training Week for chemists and informaticians working in drug discovery the week of 3-7 July 2006 at the Chemistry Research Laboratory, Oxford University, Oxford, UK.  The week will consist of interactive pragmatic workshops led by leading experts and industry practitioners.  We will work through in detail and discuss practical examples, methods and emerging techniques.

Topics to be covered include Virtual Screening & Docking, Structure-based Drug Design, Reaction Planning & Library Design, Latest advances in ADME & Predictive Toxicology, Data Analysis & Visualisation and Integration of Cheminformatics & Bioinformatics Tools & Data.

These workshops are aimed to provide a set of stimulating workshops using latest advanced modelling techniques of relevance to chemists, life scientists and modellers working in drug discovery.  Participants should return to their labs with new ideas, best practices and software experiences to maximise productivity in their own drug discovery research activities.

Workshop groups will study problems with hands-on examples using leading-edge software and discuss complex issues highlighted by examples and case studies presented by instructors.  Software packages and an IT classroom will be used by instructors and participants to work through the problems. Participants may propose problems and issues to the faculty ahead of the workshop week and can also bring their own laptops to test software on their own test problems.

Participants will have ample opportunity to discuss their perspectives and criticisms of the methods studied and should take-away key nuggets of understanding from these intensive sessions.

Program includes the following workshops:

In silico Library Enumeration of Synthetically Feasible Libraries
Workshop Instructors: Gyorgy Pirok, Chief Technology Officer, ChemAxon
and Daniel Butler, Scientific Executive, Inhibox

Reaction Modelling and Prediction of Reaction Thermodynamics & Kinetics
Workshop Instructor: David Gallagher CAChe, BioSciences Group, Fujitsu

Applications of Filtering and Similarity in Virtual Screening
Workshop Instructor: Paul Hawkins, OpenEye Scientific Software

User Perspective Evaluation of Virtual Screening Methods

Workshop Instructor: David Lloyd, Hitachi Professor, Trinity College Dublin

Advances in Virtual Screening and Structure-based Drug Design
Workshop Instructors: Jas Gata-Aura and Gerd Rather, Schrodinger

Data Analysis & Visualisation in Discovery Chemistry & Biology
Workshop Instructor: Lennart Eriksson, Umetrics

Predictive Toxicology
Workshop Instructors: Mark Cronin, Liverpool John Moores University and Scott McDonald, Lhasa Limited

Discovery Data Mining using Data Pipelining
Workshop Instructor: Rob Brown, Scitegic

More details on the workshops can be found on the eCheminfo website at http://echeminfo.colayer.net/COMTY_training

Details on Accomodation and Registration follow here...

Continue reading "Training Week - Latest Advances in Drug Discovery Design & Planning Methods" »

November 03, 2005

Program/Schedule for European eCheminfo InterAction meeting (9,10 November 2005)

The European eCheminfo InterAction meeting "Applications of Cheminformatics and Chemical Modelling to Drug Discovery" takes place in Basel, Switzerland next week (November 9-10).  The schedule with list of presentations and discussion sessions is available at:
http://echeminfo.colayer.net/files/SpeakerTopcSchedule-IA05-Basel-v5.pdf

The eCheminfo meeting runs concurrently with our InnovationWell Community meeting; registrants are welcome to attend sessions from either program.  Further details, such as abstracts and speaker bios, can be viewed in the Program areas of either the http://innovationwell.net/ or the echeminfo.com websites.

We will be closing registrations shortly, so please let us know quickly if you'd like to come.  Register through the websites or contact Nicki Douglas (nicki.douglas at douglasconnect.com, +41 61 851 04 61)

And if you can't make the meeting, you can access the recordings and presentation materials if you become a member of one of these communities. You will also have access to all previous sessions (including those from last month's US InterAction meetings in Philadelphia being released this month).

Best regards,
Barry Hardy
Community of Practice Manager
Douglas Connect, Switzerland
tel: +41 61 851 0170

September 23, 2005

eCheminfo and InnovationWell Autumn 2005 Program

I summarise below the list of sessions with over 100 top speakers and discussion leaders for the program listing for the upcoming Autumn InterAction Meetings taking place in Philadelphia and Basel. Please add a traceback or link for other like-minded folk to find!

I also provide an electronic brochure for download here for the InnovationWell Autumn Program on Knowledge-based Innovation in Life Science Product Development:
http://barryhardy.blogs.com/theferryman/files/InnovationWell-ProgramAutumn05.PDF

And the equivalent eCheminfo brochure on Drug Discovery:
http://barryhardy.blogs.com/theferryman/files/eCheminfo-ProgramAutumn05.pdf

Note: The poster sessions will be run as electronic poster sessions using tabletop spaces, a wireless network and Internet facilities at the meetings, in addition to virtual access through the website, i.e., the posters will be electronic but the access can be face-to-face or virtual. You can participate in person and virtually in the poster sessions.  We can supply nourishment and refreshments locally; remote participants may have to order out! [We also expect, subject to on-site testing, to have live conference call capabilities for remote participating members to join local discussions.]  Anyone interested in presenting such an "electronic poster" should directly contact us via email at innovationwell at douglasconnect.com

Look forward to seeing you in Philadelphia or Basel!

Barry Hardy
Community of Practice Manager
Douglas Connect
http://douglasconnect.com/
+41 61 851 0170 (office)

InnovationWell & eCheminfo InterAction Meetings
Philadelphia, US, 11-12 October 2005 and Basel, Switzerland, 9-10 November
List of Sessions with Speakers & Schedule (InterAction Autumn Meetings)
http://innovationwell.net/ and http://echeminfo.com/

Registration to attend the meetings or to access virtually is available through the websites or through contacting Nicki Douglas [nicki.douglas at douglasconnect.com]

…PROGRAM LISTING CONTINUING IN FULL POSTING…..

Continue reading "eCheminfo and InnovationWell Autumn 2005 Program" »

August 11, 2005

eCheminfo Autumn 2005 Program Brochure - Cheminformatics & Modeling Applications in Drug Discovery

Here is a copy of the Program Brochure for our eCheminfo Autumn 2005 Program on Cheminformatics & Modeling Applications in Drug Discovery including our US InterAction Meeting in Philadelphia 11,12 October 2005 and our European meeting in Basel 9,10 November 2005:

Download eCheminfo-ProgramAutumn05.pdf

Barry Hardy
eCheminfo Community of Practice Manager
Douglas Connect, Switzerland
+41 61 851 0170 (office)

August 03, 2005

eCheminfo LinkedIn Networking Group

One of our primary goals with the eCheminfo Community of Practice is to provide an opportunity for focused professional peer-to-peer networking for those with interests in drug discovery informatics,  modelling and design.  It is hard to beat face-to-face meetings for networking and for that we hope you can make our InterAction Meetings in Philadelphia 11-12 October ’05 and in Basel 9-10 November ’05 (Program information on the meetings is being updated in the Program area of http://eCheminfo.com/)

We have also been piloting use of the LinkedIn service as a new networking tool for the group.  This is a useful tool for discovering more about other peer professionals with like interests and making contacts.

To join the eCheminfo community and to take advantage of this networking service simply complete the following two steps:

1) Complete the eCheminfo SignUp available at http://eCheminfo.com/.

2) Use the link below to join, and hence be able to make direct contact (within our and LinkedIn's carefully considered privacy guidelines and controls) with group members on queries, ideas, opportunities and proposals together.

Whether you use LinkedIn already or are new to LinkedIn, please join the members-only eCheminfo Group here (It is only one step away) by clicking on the following link:

https://www.linkedin.com/e/gis/1173/77EB680070FF/
(or by pasting it into your Web browser)

Please note that all members are either pre- or post-approved by me.

Through the members-only eCheminfo Group you can:

- Leverage the power of the eCheminfo network to find and reach the new contacts you need
- Accelerate your career through referrals from eCheminfo members
- Know more than a name: view rich professional profiles from fellow eCheminfo members
- Let other eCheminfo members know what you have to offer to them and their contacts
- Limit your network searches to other eCheminfo members only, if and when you wish to do so

Access to special eCheminfo features on LinkedIn is currently free of cost or financial obligations, and is available to eCheminfo members only. Any changes to this policy affecting an individual's service will be first subject to my and the individual member's explicit approval.

Barry Hardy
eCheminfo Community of Practice Manager
Douglas Connect, Switzerland
+41 61 851 0170 (office)

July 30, 2005

Virtual Screening, Docking & Scoring Program

Specific binding interactions are central to many biological processes and pathways. Similarly, most drugs act by binding specifically to a site on a target protein, thereby modulating protein activity. The quest for new drugs relies on many approaches, including computer-based docking. This computational approach has the potential to have a profound positive impact on drug discovery, justifying the keen interest in this field by pharmaceutical researchers. Over the past fifteen years, subsequent to the initial description of the DOCK program and in parallel with the exponential increase in the number of available high-resolution protein structures, many docking programs of use as virtual screening tools in the drug discovery process have emerged. Understanding the similarities and differences of different methods as well as their capabilities and limitations is both important and increasingly challenging.

The main objective of our "Virtual Screening, Docking and Scoring" program is to foster discussion amongst researchers working on both development of screening and docking methods and the application of such methods to drug discovery. This interaction will lead to a better understanding of the current state-of-the-art, improved screening and docking tools in the future, and enhanced awareness of how to apply the current set of tools.

We are holding InterAction Meeting sessions at Bryn Mawr College, Philadelphia on 11 October ’05 and in Basel, Switzerland on 9 November ’05 on Virtual Screening, Docking and Scoring.  The purpose of the meetings is to bring together world-class researchers working in the computational screening and docking area, and have them present and discuss recent and new results in cutting-edge research areas relevant to drug discovery. The emphasis of the meeting activity is on communication and open discussion, which we hope will lead to new ideas and collaborations towards progress and innovation in the disease areas discussed.  A significant amount of time at the meeting will be devoted to question time and panel and group discussions towards this goal.

A description of the session and invited speaker presentations with abstracts follows:

eCheminfo InterAction Meeting Session, Philadelphia, 11 October 2005

Virtual Screening, Docking & Scoring
chaired by Max Cummings (Johnson & Johnson)
eCheminfo 2005 US InterAction Meeting, 11-12 October 2005, Philadelphia, USA
Program Updates loaded in the Program Areas on http://echeminfo.com/

Invited Speakers & Discussion Leaders:
Reducing false positives in virtual screens on kinase targets, Emanuele Perola (Vertex)
New Docking Methods for Pose Prediction and Enrichment, Mark McGann (Openeye)
Molecular Docking as a Virtual Screening Tool, Renee DesJarlais (Johnson & Johnson)
Exploiting protein-specific information in docking, Willem Nissink (Cambridge Crystallographic Data Centre)
You can't find what's not there: the importance and the pitfalls of multiple representations of molecules in dockable databases, John Irwin (UCSF)

eCheminfo InterAction Meeting Session, Basel, 9 November 2005
Virtual Screening, Docking & Scoring
chaired by Miklos Vargyas (ChemAxon)
eCheminfo 2005 European InterAction Meeting, 9-10 November 2005, Basel, Switzerland
Program Updates loaded in the Program Areas on http://echeminfo.com/

Invited Speakers & Discussion Leaders:
Pattern Recognition and Grid Computing in Drug Discovery, Graham Richards (University of Oxford)
The Issue of Protein Flexibility in Docking-Based Virtual Screening, Xavier Barril (Vernalis)
Discovery of cell-permeable nonpeptide inhibitors of beta-secretase by high-throughput docking and continuum electrostatic calculations, Amedeo Caflisch (University of Zurich)
Beyond ligand flexibility: improvement and validation of rDock for Structure-Based Drug Design, David Morley (Enspiral Discovery Ltd)
Structure-based identification of GPCR ligands by high-throughput docking, Didier Rognan (University of Strasbourg)

[COMPLETE ABSTRACTS FOLLOW IN REMAINDER OF POST]

Continue reading "Virtual Screening, Docking & Scoring Program" »

CFP: Posters, Contributed talks, Bursaries for eCheminfo Autumn meetings on Drug Discovery

Poster Abstracts for eCheminfo Autumn InterAction Meetings should be submitted as soon as possible and at the latest by: 15 August ’05 for US meeting posters; 31 August ’05 for European meeting posters.

Posters can be on any informatics or modeling topic (and can also be combined with experimental approaches) of relevance to Drug Discovery.

Poster Abstracts (ca. 300 words) should be sent to eCheminfo at douglasconnect.com

We will select a number of contributed talks to be presented at the meetings based on abstracts submitted.  We also have sponsorship requests under review to support travel bursaries for a selection of young academic investigators to attend the meeting based on submitted abstract.

Papers submitted related to poster presentations or talks will be refereed for consideration for publication in a special Drug Discovery issue of the Molecular Simulation journal.  Deadline for paper submission: 31 October ’05 for US meeting; 30 November ’05 for European meeting posters.

Subscribed members who cannot attend the meetings in Philadelphia or Basel in person will be able to access all meeting presentations with audio and to submit posters through the eCheminfo website. 

You must also have completed your annual membership subscription or meeting registration in time through the Online Ticket Office available after login to http://echeminfo.com/ so that access rights or a meeting place reservation are provided for you.   Alternatively contact Nicki Douglas (nicki.douglas at douglasconnect.com, +41 61 851 0461) for support.  Academic and group rates are available.

Barry Hardy
eCheminfo Community of Practice Manager
Douglas Connect, Switzerland
+41 61 851 0170 (office)

eCheminfo US Autumn 2005 InterAction Meeting:  Applications of Cheminformatics & Chemical Modelling to Drug Discovery
October 11-12, Bryn Mawr College, Philadelphia, USA
and
eCheminfo European Autumn 2005 InterAction Meeting:  Applications of Cheminformatics & Chemical Modelling to Drug Discovery
November 9-10, Swissotel L’entrée Conference Center, Basel, Switzerland

You can view meeting abstracts in the Program area of http://echeminfo.com/

July 23, 2005

Web-based Services in Drug Discovery

Pharmaceutical research is under challenge to improve the choice, quality and safety of lead candidates. There is a clear need for an open discussion and an awareness of the requirements for a much more complex knowledge management and knowledge transfer between academic, government and commercial interests. The semantic web has the potential to make significant contributions to the drug discovery of the future but is at this time at an early development stage and there are only a few public tools for the data mining and sharing of chemical information.

Just a few years ago, the only imaginable way of doing in silico drug design - or, indeed, any cheminformatics research - was to use in-house and commercial software and databases. New developments in Web services however are offering today’s researchers additional resources. Although cheminformatics admittedly lags far behind bioinformatics (where an enormous wealth of data and software is literally a click away), we are beginning to see some chemical resources in open access.

A goal for this eCheminfo program on "Web-based Services in Drug Design" is to present some of the possibilities of web-based tools and data and to lead into discussions on how can web services work for both the academic world and industry, while maintaining commercial, ip and security concerns? What potential impact could they have on discovery productivity? What are the best sustainable business models that can be applied to such services? How significant are the benefits of increased upstream and downstream knowledge flow due to services based on ontology frameworks? What are the key current hindrances to be overcome for the integration of web services into drug discovery in the chemical information area?

We are convening the following community of practice meeting sessions  to both present the latest research advances in this area and to discuss the above issues.  (As with all our meetings, if you cannot make the meetings you can access the seminars and discussions through the eCheminfo website by signing up for community membership.)

Applications of Web-based Services in Drug Discovery
eCheminfo InterAction Meeting Session, Philadelphia, 11 October 2005
chaired by Marc Nicklaus, (National Institutes of Health)
eCheminfo 2005 InterAction Meeting, 11-12 October 2005, Philadelphia, USA

Presenters & Discussion Leaders:
A Web-based Chemoinformatics System for Drug Discovery, Brett Tounge (Johnson & Johnson)
Web enabling technology for the design, enumeration, optimization and tracking of compound libraries, Brad Feuston (Merck)
ZINC web services - providing 3D structures of purchasable compounds for virtual screening to humans and machines, John Irwin (UCSF)
Pubchem, Steve Bryant (NCBI)
Search-and-query Information System for the Study and Discovery of Novel Agents in the Treatment of Cancer, David Covell (NCI)
Title TBA, Dmitrii Rassokhin (Johnson & Johnson)

Applications of Web-based Services in Drug Discovery

eCheminfo InterAction Meeting Session, Basel, 10 November 2005
chaired by Kim Henrick (European Bioinformatics Institute)
eCheminfo 2005 InterAction Meeting, 9-10 November 2005, Basel, Switzerland

Presenters & Discussion Leaders:

Investigating chemical trends in the context of ligand-protein complexes by using on-line data analysis directly on the web, Dimitris Dimitropoulos (European Bioinformatics Institute)
The Representation of Chemical Structures and its Application to Property Prediction, Johann Gasteiger (Universitaet Erlangen-Nuernberg)
Open Archives as a Route for the Capture, Dissemination and Access to Chemical Information, Simon Coles (University of Southampton)
Identification of biological units in protein crystals, Eugene Krissinel (European Bioinformatics Institute)
SWISS-MODEL Server and Repository: Web based resources for comparative protein structure modeling and their application in drug discovery, Torsten Schwede (University of Basel)

ABSTRACTS

PHILADELPHIA SESSION

A Web-based Chemoinformatics System for Drug Discovery

Brett A. Tounge, Johnson & Johnson Pharmaceutical Research and Development, L.L.C., Welsh and McKean Roads, PO Box 776, Spring House, PA 19477, USA

One of the key questions that must be addressed when implementing a chemoinformatics system is whether the tools will be designed for use by the expert user or by the “bench scientist”.  This decision can impact not only the style of tools that are rolled out, but is also a factor in terms of how these tools are delivered to the end users.  The system that we outline here was designed for use by the non-expert user.  As such, the tools that we discuss are in many cases simplified versions of some common algorithms used in chemoinformatics.  In addition, the focus is on how to distribute these tools using a web services interface, which greatly simplifies delivering new protocols to the end user.

Web enabling technology for the design, enumeration, optimization and tracking of compound libraries

Bradley P. Feuston(1)*,Subhas J. Chakravorty(1), John F. Conway(3),J. Christopher Culberson(1), Joseph Forbes(1), Bryan Kraker(2), Patricia A. Lennon(3), Craig Lindsley(5), Georgia B. McGaughey(1), Ralph Mosley(2), Robert P. Sheridan(2), Mario Valenciano(4) and Simon K. Kearsley(2)

(1) Molecular Systems Department, P.O. BOX 4, West Point, PA 19486
(2) Molecular Systems Department, P.O. BOX 2000, Rahway, NJ 07065
(3) Basic Research Systems, P.O. BOX 4, West Point, PA 19486
(4) Basic Research Systems, P.O. BOX 2000, Rahway, NJ 07065
(5) Medicinal Chemistry Department,  P.O. BOX 4, West Point, PA 19486

Motivated by the need to augment Merck’s in-house small molecule collection, web-based tools for designing, enumerating, optimizing and tracking compound libraries have been developed. The path leading to the current version of this virtual library tool kit (VLTK) is discussed in context of the (then) available commercial offerings and the constraints and requirements imposed by the end users. Though the effort was initiated to simplify the tasks of designing novel, drug-like and diverse compound libraries containing between 2K-10K unique entities, it has also evolved into a powerful tool for outsourcing syntheses as well as lead identification and optimization. The web tool includes components that select reagents, analyze synthons, identify backup reagents, enumerate libraries, calculate properties, optimize libraries and finally track the synthesized compounds through biological assays. In addition to accommodating project specific designs and virtual 3D library scanning, the application includes tools for parallel synthesis, laboratory automation and compound registration.
To whom correspondence should be addressed.

Search-and-query Information System for the Study and Discovery of Novel Agents in the Treatment of Cancer

David Covell, National Cancer Institute, Building 1052, Rm 236, Frederick, MD 21702-1201, USA

The National Cancer Institute has maintained a panel of immortalized tumor cell lines since 1990 for the purpose of screening chemical agents as candidates in the search for more effective cancer treatments. During these past 25 years nearly 80,000 small synthetic compounds and an equal number of natural product extracts have been assayed in the NCI’s tumor screen. In parallel with this screening effort, the NCI has maintained a follow-up protocol comprised of secondary testing of active compounds in hollow-fiber and xenograft models. Parallel measurements of gene expressions within the NCI’s tumor cell panel have complemented this data as well as the development of nearly a dozen molecular target assays screened against a panel of ~200,000 small molecules. The data generated in these screens has been the subject of efforts to devise informatic-based methods for data mining. The product of this effort is embodied within the publicly accessible web tool at the url ‘spheroid.ncifcrf.gov’. The utilities within this web site represent a search-and-query information system for the study and discovery of novel agents in the treatment of cancer. The functionalities within this web utility allow interactions with a wide range of public databases and the NCI’s screening data. Presentation and discussion of these utilities will be made in the context of recent drug discovery explorations.

PubChem

Steve Bryant, NCBI

PubChem is a new online information resource from NCBI. The system provides information on the biological properties and activities of chemical substances. Following the sequence-deposition model followed by GenBank, PubChem's content is derived from user depositions of chemical structure and
bioassay data, including data from NIH's Molecular Libraries initiative. The retrieval system supports searches based on chemical names and chemical structure, as well as searches based on bioassay descriptions and activity information. It furthermore provides links to depositor sites, for further information, as well as links to other NCBI resources such as the PubMed literature database and Entrez's protein 3D structure database.

ZINC web services - providing 3D structures of purchasable compounds for virtual screening to humans and machines

John Irwin, Pharmaceutical Chemistry, UCSF, 1700 4th St, Suite 501D, San Francisco CA 94143-2550 USA

Despite the successes of virtual screening, and its growing use, there remain many barriers to entry for non-specialists wishing to use this technology. We created the ZINC database, a free collection of commercially available compounds for virtual screening to lower one of these barriers. We made ZINC more flexible and adaptive by creating web services. Molecules matching specific criteria, including chemical structures, may be searched, often in less than a minute. Results of datasbase searches may be reviewed in a web browser, the 3D structure of molecules may be viewed in a Jmol applet, and structures may be downloaded individually or en masse in popular formats. Because there will always be molecules that are not in ZINC, we allow users to upload and process their own molecules using the same protocols we use to prepare the database. The ZINC webserver can also handle queries in machine readable form using a well defined ontology.

BASEL SESSION

Open Archives as a Route for the Capture, Dissemination and Access to Chemical Information

Simon J. Coles, School of Chemistry, University of Southampton, Highfield, Southampton, Hampshire, SO17 1BJ, UK

Modern advances in high throughput synthesis, scientific analytical instrumentation and data analysis and mining techniques are presenting increasingly big challenges for chemical information management and discovery. Consequently, the conventional process of peer review of journal articles as the primary route for the dissemination of scientific data is unable to keep apace with these high rates of generation and is hindering the passage of this data to the public domain. The architecture and philosophy of the Open Archive presents a solution to both the data management and publication problems.

     Recent work undertaken by the eBank-UK project (http://www.ukoln.ac.uk/projects/ebank-uk/) has been addressing the issue of dissemination of scientific data and uses the philosophy of the Open Archive Initiative (OAI) to solve this problem, whilst the R4L project (http://r4l.eprints.org) uses the same approach for laboratory data management. The UK National Crystallography Service (NCS) (http://www.ncs.chem.soton.ac.uk/) has developed an Open Archive infrastructure for crystal structure data (http://ecrystals.chem.soton.ac.uk) as an exemplar of this methodology.

    All the data generated during the course of the crystal structure determination experiment is seamlessly or automatically captured, time-stamped for priority assertion purposes and deposited in a laboratory management repository. A report generation tool is then employed to collate all experimental information in the laboratory repository, based on a particular compound. This report is utilised to prepare a journal article, based on the experimental data, and both write ups are subsequently deposited in an Institutional Repository. The Institutional Repository publicises its data content to the internet through Open Archive Initiative (OAI) protocols, which allows aggregator services to harvest pertinent metadata. The aggregator search and discovery tools then provide seamless and unhindered access to the scientific reports and their underlying data, thus maximising efficient sharing of experimental chemical information.

Investigating chemical trends in the context of ligand-protein complexes by using on-line data analysis directly on the web

Dimitris Dimitropoulos, EMBL Outstation - Hinxton, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom

A prebuild relational data-warehouse with a rich set of relationships for the PDB and the PDB ligands, is the perfect environment for evaluating interesting research questions that may potentially reveal links between ligand chemistry, and the protein environment characteristics. In the MSD search database several common functional groups have been identified for ligands and associated on the atom level to the occurrences of the bound molecules in PDB entries using a consistent nomenclature. These are used as a starting point in order to explore the chemistry relationships to binding site information, secondary structure, and protein classification. As an example we will demonstrate how the MSD-mine tool for on-line data analysis and mining over the web, can be used to examine potential preference of functional fragment distribution towards particular SCOP domains, and in a separate example the contribution of different fragment areas in the binding site activity. The MSD-mine tool and the example scenarios are accessible from http://www.ebi.ac.uk/msdmine.

Identification of biological units in protein crystals

Eugene Krissinel & Kim Henrick, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, UK

PDB entries of protein structures solved by means of X-ray diffraction on protein crystals represent asymmetric units (ASU) of the crystals. In most instances, ASU may be chosen in many different ways and they do not necessarily coincide with the biological units, or stable protein assemblies that perform certain physiological functions. It is reasonable to expect that protein assemblies merge, rather than transform, during the crystallisation, therefore protein crystals do carry rather valuable data on the composition and geometry of biological units. Given that nearly 80% of PDB entries are obtained by means of protein crystallography and that direct experimental identification of assembly structure is difficult, detection of biological units in protein crystals is of considerable practical interest.

We propose a new approach to the problem from general principles of chemical thermodynamics, which is different from previous attempts [1,2] based on scoring of individual protein interfaces in crystal. We perform an exhaustive graph-theoretical search of all assemblies that are possible in a given crystal, and leave only those that appear to be thermodynamically stable. The stability estimate is based on the consideration of protein affinity and entropy change upon dissociation. Applied to PDB entries with oligomeric states known from the literature, our method gives 89% of correct predictions, which is higher than previously reported [2].

The method is implemented as a publicly available web server (http://www.ebi.ac.uk/msd-srv/prot_int/cgi-bin/piserver), which provides the assembly-related data for all PDB entries of structures solved by X-ray diffraction. The server can take PDB and mmCIF-formatted coordinate files for upload and calculate protein assemblies in real time (a few minutes in most instances). Detail, on-residue level, data on protein interactions, solvation energies, surface areas, hydrogen bonds and salt bridges are provided on output. Probable dissociation patterns of stable assemblies are also calculated. The calculated assemblies and individual crystal contacts (interfaces) may be visualised using the Rasmol software. The server includes a search facility for the identification of structurally equivalent protein interfaces in the PDB archive.

[1] Henrick, K.; Thornton, J. Thrends Biochem. Sci., 1998, 23, 358.
[2] Ponstingl, H.; Kabir, T.; Thornton, J. J. Appl. Cryst. 2003, 36, 1116.

SWISS-MODEL Server and Repository: Web based resources for comparative protein structure modeling and their application in drug discovery

Jürgen Kopp, Lorenza Bordoli, Konstantin Arnold, Markus Meuwly, Vincent Zoete, Hólmfríður B. Þorsteinsdóttir, and Torsten Schwede, University of Basel (Switzerland) & Swiss Institute of Bioinformatics

One of the bottlenecks of structure-based drug design is the availability of experimentally determined protein structures. Today, the number of structurally characterized proteins is about two orders of magnitude smaller than the number of known protein sequences, i.e. no direct experimental structural information is available for the vast majority of protein sequences. Theoretical methods for protein structure prediction are aiming to bridge this structure knowledge gap. As shown during the biannual CASP experiments, homology modeling is the only computational approach, that can generate accurate three-dimensional models for a protein for successful application in structure based drug development. The SWISS-MODEL Server and Repository have been developed to provide instant web-based access to annotated models generated by automated homology modeling, bridging the gap between sequence and structure databases.

Validation of homology models for drug discovery applications is a crucial aspect, and one important question is how errors and inaccuracies of the homology models affect the molecular modeling of protein-ligand interactions. We used a MM-GBSA approach to compute the binding free energy of interaction of 16 HIV-1 protease inhibitor complexes in experimental and model structures. Using this system, we can introduce systematic errors in the protein model to simulate the typical inaccuracies that occur during homology modeling to quantify the effect on ligand binding affinity and ranking of inhibitors.

References:
Schwede T, Kopp J, Guex N, and Peitsch MC (2003) SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Research 31, 3381-338.
Kopp J, and Schwede T (2004). The SWISS-MODEL Repository of annotated three-dimensional protein structure homology models. Nucleic Acids Research 32 , D230-D234.
Contact: Torsten Schwede

The Representation of Chemical Structures and its Application to Property Prediction

Johann Gasteiger, Computer-Chemie-Centrum, Universität Erlangen-Nürnberg
D-91052 Erlangen, Germany

The relationships between the structure of a chemical compound and many of its physical, chemical, or biological properties are too complex to be calculated from first principles. This is particularly true for the biological activity of a compound.  In this situation an indirect approach has to be taken.[1]  First, chemical descriptors have to be calculated for the molecular structure. Then, a relationship between these structure descriptors and the property to be predicted has to be established by inductive learning methods such as statistical or pattern recognition methods or neural networks. Methods for calculating structure descriptors available on the internet will be presented. Furthermore, program packages for data analysis and data mining that can be accessed on the web will be indicated. Of crucial importance for property modeling is the availability of data. Fortunately, increasingly, data become available on the internet. Some applications in modeling properties such as solubility or toxicity will be presented.

[1]  Chemoinformatics – A Textbook , J. Gasteiger, T. Engel (Editors), Wiley-VCH,
       Weinheim, 2003.

Barry Hardy
eCheminfo Community of Practice Manager
Douglas Connect, Switzerland

July 12, 2005

eCheminfo European Autumn 2005 InterAction Meeting

The eCheminfo European Autumn 2005 InterAction Meeting will be taking place at the Swissotel L’entrée Conference Center, Basel, Switzerland, Nov 9-10.  Program information and registration is available through the eCheminfo website at http://echeminfo.com/

The theme of the meeting is the application of cheminformatics and chemical modelling to drug discovery and will include the following sessions:

Virtual Screening, Docking & Scoring, chaired by Miklos Vargyas (ChemAxon); Web-based Services in Drug Design, chaired by Kim Henrick (European Bioinformatics Institute); Computational Biochemistry, chaired by Alessandro Curioni (IBM Zurich); Applications of Machine Learning & Graph Mining in Drug Discovery, chaired by Stefan Kramer (Technische Universitaet Muenchen); Biosensors & Nanofluidics, Nick Quirke (Imperial College London); Protein Folding and Dynamics, chaired by Wilfred van Gunsteren (ETH-Zurich)

Invited Speakers include:
Graham Richards (University of Oxford), Xavier Barril (Vernalis), Amedeo Caflisch (ETH-Zurich), David Morley (Enspiral Discovery Ltd), Didier Rognan (University of Strasbourg), Dimitris Dimitropoulos (European Bioinformatics Institute), Johann Gasteiger (Universitaet Erlangen-Nuernberg), Simon Coles (University of Southampton), Eugene Krissinel (European Bioinformatics Institute), Torsten Schwede (University of Basel), Paul Tavan (University of Munich), Ursula Roethlisberger (Swiss Federal Institute of Technology, Lausanne), Alessandro Curioni (IBM Zurich), Ian Williams (University of Bath), Gerald Monard (Universite Henri Poincare – Nancy), Ken Merz (QuantumBio), Nick Quirke (Imperial College London), Richard Gilbert (E2v Technologies), Andrew De Mello (Imperial College London), Jens Walther (ETH-Zurich), Christoph Helma (University of Freiburg), Peter Willett (University of Sheffield), Joost N. Kok (Leiden University), Gisbert Schneider (Johann Wolfgang Goethe-University), Michael Berthold (University of Konstanz), Jeremy Smith (University of Heidelberg), Wolfgang Wenzel (University of Karslruhe), Thomas Kiefhaber (University of Basel), Xavier Daura (University of Barcelona), Nikolay V. Dokholyan (University of North Carolina), Michele Vendruscolo (University of Cambridge)

The eCheminfo meetings will have exhibitor tables for software demos and evening poster sessions; posters will also be available for viewing through the website.  Posters can be on any informatics or modeling topic of relevance to drug discovery. Those who cannot make the meetings are also able to register to access all presentations, posters and discussions through the eCheminfo website. Registrants for the European meeting will also be able to access the full proceedings with audio of the equivalent US meeting being held in Philadelphia.

Poster Abstracts (ca. 300 words) should be sent to eCheminfo at douglasconnect.com

The InnovationWell European community of practice Autumn meeting will be taking place at the same location and on the same dates as the eCheminfo European meeting.  This meeting will be covering semantic web applications in drug discovery and development, life science intellectual property management and valuation; knowledge management and transfer in R&D and manufacturing, clinical trial productivity, drug safety knowledge management and predictive toxicology.  All eCheminfo registrants will also have access to the InnovationWell meeting activities. (See http://innovationwell.net/ for more details).

Please register early for the meetings as the number of places are limited.  You can register for the meeting on the website or contact Nicki Douglas (nicki.douglas at douglasconnect.com, +41 61 851 0461) to reserve your place.

Barry Hardy
Community of Practice Manager
Douglas Connect

Communities of Practice

eCheminfo Chairs, Presenters & Instructors