Journal article icon

Journal article

BIGSdb: Scalable analysis of bacterial genome variation at the population level

Abstract:

Background: The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms. These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner.

Results: The Bacterial Isolate Genome Sequence Database (BIGSDB) is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens. The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST) data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences. These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses. Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches. LIMS functionality of the software enables linkage to and organisation of laboratory samples. The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database. Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus.

The BIGSDB source code and documentation are available at:
http://pubmlst.org/software/database/bigsdb/.

Conclusions: Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies. BIGSDB represents a freely available resource that will assist the broader community in the elucidation of the structure and function of bacteria by means of a population genomics approach.

Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1186/1471-2105-11-595

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Zoology
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Zoology
Role:
Author


Publisher:
BioMed Central
Journal:
BMC Bioinformatics More from this journal
Volume:
11
Article number:
595
Publication date:
2010-12-10
Acceptance date:
2010-12-10
DOI:
EISSN:
1471-2105


Language:
English
Keywords:
UUID:
uuid:a4f29613-19f1-4963-b048-c592569bfda2
Local pid:
pubs:210263
Source identifiers:
210263
Deposit date:
2013-11-16

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP