The growth of molecular data from the fields of genomics, metagenomics and related 'omic' disciplines calls for ever improving methods of data collection, storage and analysis. The field of bioinformatics is rich in software and fast, economical computing environments are becoming essential components of almost all research labs pursing scientific questions using these data-rich technologies.
Intended users of NEBC Bio-Linux 5.0 range from students entering the field of bioinformatics and new users of Linux to institutional teaching labs and expert computational biology groups well versed in Linux looking to use the existence of freely available customised distributions to build and maintain computational infrastructure quickly and effectively.
Previously NEBC Bio-Linux was only easily accessible to NERC funded researchers through an application process. With the release of version 5.0, this system is available on-line for easy download. The simplified access means researchers worldwide can also benefit from the opportunities offered by Bio-Linux. Researchers in North America, Europe, New Zealand, India, Iran, Africa and China have already taken advantage of Bio-Linux and many more users are anticipated as the field of bioinformatics continues to grow rapidly.
Dawn Field, Director of NEBC, stated: "To apply information technology to the field of molecular biology researchers need access to multi-user, networked machines that are fast and contain a large suite of software. The NEBC Bio-Linux project has distributed the specialist skills and expertise needed to build this type of infrastructure within the United Kingdom. The result is a new generation of PhD students and postdocs in this community with more sophisticated computing skills. With the release of version 5.0 we aim to allow the rest of the world to take advantage of these developments."
NEBC's main funder, the United Kingdom Natural Environment Research Council is further supporting the implementation of Bio-Linux by funding a new NERC Environmental Bioinformatics Facility at the Centre for Ecology & Hydrology. This new facility, the fifth node of the NERC Molecular Genetics Facility (MGF), will become fully operational later in 2009 but from today it will be possible for researchers to cost Bio-Linux and associated bioinformatics support into NERC grant applications. The intention is that Bio-Linux will become the underpinning computational environment for all activities within the MGF-Oxford node.
Dawn Field added: "We need to foster a highly collaborative community that can make use of a network of computers throughout the United Kingdom. NEBC Bio-Linux provides a powerful framework for delivering support, minimizing duplication of effort and most importantly, empowering researchers to take on their own analyses using a large suite of tools."
The more visionary role of NEBC Bio-Linux is to build electronic networks of researchers with shared interests, using a shared platform. This is already happening with a recent application of Bio-Linux in Africa. Peter Dawyndt, Professor of Computing at the University of Ghent, found Bio-Linux on the web. He commented: "Bio-Linux is a terrific solution to our need to bring state-of-the-art bioinformatics computing platform to students in Africa. With just a set of DVDs in our luggage, we are able to install a top-notch computing environment in which to deliver our entire bioinformatics course. Most importantly, the entire infrastructure stays behind and remains available to interested students."
Bio-Linux is a derivative of Ubuntu Linux, customised for bioinformatics analysis and development work. Approximately 60 bioinformatics packages - providing around 500 individual programmes - are installed on Bio-Linux, including open-source packages developed at the NEBC. In addition, Bio-Linux comes with comprehensive, categorised documentation for the bioinformatics packages installed. Users can install a full NEBC Bio-Linux system or just add some or all packages to already installed Debian or Ubuntu Linux systems.
Lead Developer, Stewart Houten, stated: "Bio-Linux 5.0 retains the added-value features of Bio-Linux 4.0, but is now based on the highly popular and user-friendly Ubuntu distribution and the Gnome desktop. The system is available as an installable DVD or USB memory stick, making it readily accessible to a wide audience."
The open source GNU/Linux computing system is progressively being seized upon as the preferred choice in addressing researcher computing needs. Despite Linux distributions becoming easier to use, the task of configuring the system for a specific purpose and collecting, compiling and setting up the academic software remains challenging. Bio-Linux provides a solution to this challenge.
Stewart Houten added: "We make design choices in consultation with our community and continually adapt NEBC Bio-Linux to meet their needs. This release responds to the growing skills in our community in the use of Bio-Linux and the striking increase in the number of users downloading Bio-Linux or its packages from locations outside the United Kingdom."
NEBC Lead Bioinformatician, Bela Tiwari, stated: "The NEBC Bio-Linux network accelerates research through improved electronic communication and support. I can log into a remote machine when requested, allowing me to directly troubleshoot or undertake collaborative analysis. Likewise, researchers can make use of a range of mechanisms for securely sharing data. Having many users able to access a single well-maintained machine also makes effective use of NERC funds and research time alike."
Tony Travis of the NuGO consortium, stated: "The NEBC Bio-Linux package repository has been an essential element of the success of our NuGO Black box project, designed to equip our community of researchers with integrated bioinformatics solutions." Researcher Keith Jolley, of the University of Oxford, adopted Bio-Linux to deploy a specialist set of software for genetic tracking of pathogens in a clinical setting. He stated: "The availability of Bio-Linux has made it possible to distribute and maintain our software network with minimum effort."
NEBC Bio-Linux was conceived in 2002 as part of the data management plan of the NERC Environmental Genomics Science Programme. Dr. Pamela Kempton, of NERC stated: "The Bio-Linux project has fully delivered against our expectations for the project. We are pleased to see this new development and the potential for Bio-Linux to reach a wider user audience."
Dr. Jason Snape, based within AstraZeneca UK Ltd. and the Science Co-ordinator of the two NERC Environmental Genomics Programmes, stated: "NEBC and Bio-Linux were established at the outset of the NERC investment in genomics. This was a highly strategic investment aimed at building a community of environmental scientists at the forefront of genomics research that had access to the most sophisticated informatics infrastructure, technical support, advice and training that was available." Dr. Snape continued to say: "The global success of Bio-Linux and the efforts of the NEBC team in promoting high quality training and data management standards has delivered above and beyond the original vision of the Environmental Genomics Steering Committee. NEBC truly adds value to the NERC environmental genomics research community."
Researchers and developers alike are welcome to join the NEBC Bio-Linux project and more information about the project can be found on the NEBC Bio-Linux homepage.