Scientists now depend on databases to access the avalanche of information that they produce. For example, geneticists are trawling through the human genome for genes that are involved in diseases. Data providers put a huge amount of effort into providing data resources that are comprehensive, user-friendly and cross-linked to other databases; but different data providers use different methods.
This means that a researcher might have to search ten or more different databases to find all the information pertaining to a particular set of candidate genes. If they're doing these kinds of searches on a regular basis, they'll want their own local copies of the databases. Maintaining up-to-date and fully functioning versions of all those databases and the tools to search them is a huge and complex task.
Vincent Breton of CNRS in Clermont-Ferrand, France, a member of EMBRACE's Executive Board, describes the problem as analogous to the use of electrical items before the electrical grid. "You didn't know whether your gadget's plug would fit the socket", he stated.
EMBRACE will turn the relationship between user and provider on its head by enabling data providers to provide well-defined interfaces to their databases that will conform to the same standards, essentially creating a "data grid" - the EMBRACEgrid - that will allow users to make the most of dispersed data resources.
To ensure that EMBRACE's efforts are immediately useful to biologists, Europe's most heavily used biomolecular databases and tools will be integrated into the EMBRACEgrid. A "technology watch" will ensure that the EMBRACEgrid doesn't become locked into technology that is quickly superseded. The Grid will also receive regular workouts using test problems, such as identifying candidate genes for a disease or linking viral mutations to their ability to cause disease.
Disseminating information about the EMBRACEgrid will be vital to ensure that scientists throughout Europe not only use the new technology, but also help to expand the capabilities of the EMBRACEgrid by "Grid enabling" their own data resources.
"Many elegant and powerful computational biology tools are under-utilized", stated EMBRACE Executive Board Member Erik Bongcam-Rudloff from the University of Uppsala, Sweden. "EMBRACE will allow us to unlock their potential by standardizing access to them."