Biological Science Data Abstracts |
|||||
Medical
and Health Data
Behavioral and Social Science Data Data Policy Detailed ProgramList
of Participants About the CODATA 2002 Conference
|
1.
Building the US National Biological Information Infrastructure:
Synergy between Regional and National Initiatives In 1994, the U.S. President signed Executive Order 12906, "Coordinating Geographic Data Acquisition and Access: the National Spatial Data Infrastructure (NSDI)." The NSDI deals with the acquisition, processing, storage, and distribution of geospatial data, and is implemented by the Federal Geographic Data Committee (FGDC). At the same time, the national biotic resource information system became the NBII (web page - http://www.nbii.gov). The NBII is implemented through the auspices of the U.S. Geological Survey (USGS). The NBII works with the FGDC to increase access and dissemination of biological geospatial data through the NBII and the NSDI. The NBII biological metadata standard, is an approved "profile" or extension of the FGDC's geospatial metadata standard. In 1998, the Biodiversity and Ecosystems Panel of the President' s Committee of Advisors on Science And Technology (PCAST) released the report titled "Teaming With Life: Investing in Science to Understand and Use America's Living Capital". The PCAST report recommended that the federal government develop the "next generation NBII" or NBII-2. This would be accomplished through a system of nodes (interconnected entry points to the NBII). In 2001, the U.S. Congress allocated the funds for the development and promotion of the node based NBII-2. Development and implementation of the NBII nodes is underway and is being conducted in collaboration with every sector of society. There are three types of nodes. "Regional" nodes have a geographic area of responsibility and represent a regional approach to local data, environmental issues, and data collectors. Twelve (12) regional nodes are required to cover the entire U.S. "Thematic" nodes focus on a particular biological issue (i.e., bird conservation, fisheries and aquatic resources, invasive species, urban biodiversity, wildlife disease/human health, etc.). Such issues cross regional, national, and even international boundaries. "Infrastructure" nodes are focused on issues such as the creation, adoption, and implementation of standards through the development of common tool suites, hardware and software protocols, and geospatial technologies to achieve interoperability and transparent retrieval across the entire NBII network. This presentation will highlight NBII development, implementation,
lessons learned, and successful user applications of two regional
nodes, the Southern Appalachian Information Node (SAIN) and
the Central Southwest/Gulf Coast Node (CSGCN). Specific NBII
applications will include multiple country-, regional-, county-,
and local- (site specific) level biological, environmental,
and natural resource management issues.
2.
Building a Biodiversity Information Network in India
Biodiversity Informatics and Developing World: Status and Potentials The most of the striking feature of Earth is the existence of life, and the most striking feature of life is its diversity. Biodiversity, and the ecosystems that support it, contribute trillions of dollars to national and global economies. The basis of all efforts to effectively conserve biodiversity and natural ecosystems lies in efficient access to knowledgebase on biodiversity and ecosystems resources and processes. Most of the developed countries are well ahead in the race to take advantage of new electronic information opportunities to manage and build their biodiversity knowledge bases, the recognized cornerstone for their future economic, social and environmental well being. For developing nations, which harbors rich and diversified natural resources, much of the biodiversity information is neither available nor accessible. Hence there is a need for organized, well-resourced, national approach to build and manage biodiversity information through collaborative efforts by this group of Third World Nations. This paper reviews the state of information technology applications in the field of biodiversity informatics in these nations, with India as model nation. India is one of the 12 mea-biodiversity countries bestowed with rich floral and faunal diversity. With its deteriorating status of natural resources and developmental activities, India is one of the best model nation for such a review. Attempts made by the author's group to develop and implement cost-efficient, easy-to-use tools for biological data management are described in brief. Feasibility of employing available tools, techniques and standards for biological data acquisition, organization, analysis, modeling and forecasting has been discussed keeping in view the informatics awareness amongst the biologists and ecologists as well as planners. With specific reference to Indian biodiversity, authors suggest the framework to build national information infrastructure to correlate, analyze and communicate biological information to help these nations to generate sustainable wealth from nature.
3.
Developing and Integrating Data Resources from a North American
Perspective Biodiversity Information denotes a very heterogenous set of data formats, updating regimes, quality, and users. The data in the labels of biological specimens provide a natural organizing framework because the georeference and the taxonomic name can be used to link to geographically organized data (remote sensing, cartography) and to a variety of points of view (ecological or genetical data, legislation, traffic, etc.). Label data, however is widely distributed over hundreds of institutions. In this talk, we describe the technical and organizational problems that were solved to create REMIB (the World Network of Biodiversity Information), that links nearly 5 million specimens from 61 collections of 16 institutions in three countries. We also give one example of the use that such system may have.
Scientists within the Long-Term Ecological Research (LTER) Network have provided leadership in ecological informatics since the inception of LTER in 1980. The success of LTER, where research projects span wide temporal and spatial scales, depends on the quality and longevity of the data collected. Scientists have devised data collection, data entry, data access, QA/QC and archiving strategies for ensuring that high quality data are appropriately managed to meet the needs of a broad user base for decades to come. The LTER cross-site Network Information System (NIS) is being developed to foster data sharing and collaboration among sites. Recent and important milestones for LTER include adoption of Ecological Metadata Language as a standard as well as supporting metadata software. Current and future foci include developing data standardization protocols and semantic mediation engines, both of which will facilitate LTER modeling efforts.
5.
The Global Biodiversity Information Facility (GBIF) Challenges
and Opportunities from a Global Perspective The Global Biodiversity Information Facility (GBIF) is a new international scientific cooperative project based on an agreement between countries, economies, and international organizations. The primary goal of GBIF is to establish an interoperable, distributed network of databases containing scientific biodiversity information in order to make the world's scientific biodiversity data freely available to all. GBIF will play a crucial role in promoting the standardization, digitization and global dissemination of the world's scientific biodiversity data within an appropriate framework for property rights and due attribution. Initially, GBIF will focus on species and specimen level data in 4 priority areas: data access and data interoperability; digitization of natural history collection data, electronic catalogue of names of known organisms; outreach and capacity building. With an expected staff of only 14, GBIF will work mostly with others in order to catalyse synergistic activities between participants, generate new investments and eliminate barriers to cooperation. In its first year of activity, GBIF has been concentrating on organisational logistics, staffing, and consultations with Scientific and Technical Advisory Groups (STAGs). Initial work plans are being drafted by the Science committee and its 4 subcommittees. Once functional, GBIF will allow to unlock and liberate vast amounts of biodiversity occurrence data for use in research and environmental decision-making. Life itself, in all its diversity (from molecules, to species, to ecosystems) will provide numerous new additional sets of data layers for integrated environmental analysis, modelling and forecasting.
1.
A Proteomic Approach to the Study of Cancer During the past 20 years, high resolution two dimensional polyacrylamide gel electrophoresis (2D PAGE) has been the technique of choice for analysing the protein composition of cell types, tissues and fluids, as well as for studying changes in protein expression profiles elicited by various effectors. The technique, which was originally described by O'Farrell and Klose, separates proteins both in terms of their isoelectric point (pI) and molecular weight. Usually, one chooses a condition of interest and lets the cell reveal the global protein behavioral response as all detected proteins can be analyzed both qualitatively (post translational modifications) and quantitatively (relative abundance, corregulated proteins) in relation to each other [http://biobase.dk/cgi bin/celis]. Presently, high resolution 2D PAGE provides the highest resolution for protein analysis and is a key technique in proteomics, an emerging area of research of the post-genomic era that deals with the global analysis of gene expression using a plethora of technology to resolve (2D PAGE), identify (mass spectrometry, Western immunoblotting, etc.), quantitate and characterize proteins, identify interacting partners as well as to store (comprehensive 2D PAGE databases), communicate and interlink protein and DNA mapping and sequence information from ongoing genome projects. Proteomics, together with genomics, cDNA arrays, phage antibody libraries and transgenic models belong to the armamentarium of technology comprising functional genomics. Here I will report on our efforts to apply proteomic technologies to the study of bladder cancer.
2.
A Proposition of XML Format for Proteomics Database The rationale and advantages of using XML: The exchange of
proteome analysis data including sample preparation and experimental
conditions in detail is very useful and important in order to
enhance studies in proteomics. Standardized format for data
exchange will accelerate collaboration among proteomics researchers.
HUP-ML and HUP-ML Editor: We have proposed an XML format, HUP-ML(Human
Proteome Markup Language), for proteomics database and have
developed a HUP-ML Editor with which researchers can easily
make HUP-ML documents. It would accelerate collaboration among
proteomics researchers if a platform exchanging these data is
developed on the internet. By using our XML-based model for proteomics, we have also developed
web-based prototype system which consists of XML database, agent,
security and graphical user interface (GUI).
3.
Proteomics : An Important Post-genomic Tool for Understanding
Gene Function While the term proteomics is often synonymous with high-throughput
protein profiling of normal versus diseased tissue by 2-D gel
analysis, this definition is very limiting. Increasingly, the
power of proteomics is being recognized for its ability to unravel
intricate protein-protein interactions associated with intracellular
protein trafficking and signaling pathways (i.e., cell-mapping
proteomics). The technology issues associated with expression
proteomics (the study of global changes in protein expression)
and cell-mapping proteomics (the systematic study of protein-protein
interactions through the isolation of protein complexes) are
almost identical and only differ in front-end scale-up processes.
The application of proteomics for studying various biological
problems will be presented with representative examples of (a)
differential protein expression for identifying surrogate markers
for colon cancer progression, (b) a non-2D gel approach for
dissecting complex mixtures of membrane proteins, (c) proteins
that inhibit cytokine signal transduction, (d) proteins that
are involved in the intricate pathway that leads to programmed
cell death (apoptosis).
4.
Human Kidney Glomerulus Proteome and proposition of a method
for native protein profiling To elucidate molecular
mechanism of a chronic nephritis, the following proteome research
of kidney glomeruli has been initiated. Pieces of cortex of
kidney with normal appearance were obtained from patients underwent
surgical nephrectomy due to renal tumor. Glumeruli preparation
were carried out from the cortex by a standard sieving process
using four sieves. The glomeruli on the 150 µm sieve were
collected and further purified by picking up under a phase-contract
microscopy. The glomeruli were spun down, homogenized in 2-DE
lysis buffer and incubated. 1.
Genetic diversity in food legumes of Pakistan as revealed through
characterization, evaluation and biochemical markers 2.
Visualization and Correction of Prokaryotic Taxonomy Using Techniques
from Exploratory Data Analysis There are, at present,
over 5,700 named prokaryotic species. There has long been a
need to organize these species within a comprehensive taxonomy
that relates each species to all the others. For some years,
researchers have been sequencing the small subunit ribosomal
RNA genes of many prokaryotes, initially to try and establish
the evolutionary relationships among all prokaryotes and subsequently
in order to aid in the identification of prokaryotes both known
and unknown. These sequences have become an almost universal
feature in the description of new species. Thus, for the purposes
of classification, the sequences are probably the most useful,
universally described characteristic of the prokaryotes. Small
subunit rRNA gene sequences were used by the staff of the Bergeys
Manual Trust to establish prokaryotic taxonomy above the Family
level only recently. This effort was facilitated by the application
of techniques drawn from the field of exploratory data analysis
to visualize the evolutionary relationships among large numbers
of sequences and, hence, among the organisms they represent.
We describe the techniques used to develop the first maps of
sequence space and the techniques we are currently using to
ease the placement of new organisms in the taxonomy and to uncover
errors in the taxonomy or in sequence annotation. A key advantage
of these techniques is that they allow us to see and use the
complete data set of over 9,200 sequences. We also present plans
for the development of a tool that will allow all interested
researchers to participate in the maintenance and modification
of the taxonomy.
Quantitative information
on the types of inter-atomic interactions at the MHC-peptide
interface will provide insights to backbone/sidechain atom preference
during binding. Protein crystallographers have documented qualitative
descriptions of such interactions in each complex. However,
no comprehensive report is available to account for the common
types of inter-atomic interactions in a set of MHC-peptide complexes
characterized by MHC allele variation and peptide sequence diversity.
The available x-ray crystallography data for MHC-peptide complexes
in the Protein Databank (PDB) provides an opportunity to identify
the prevalent types of inter-atomic interactions at the binding
interface. The prevalently dominant SB interaction at the interface suggests the importance of peptide backbone conformation during MHC-peptide binding. Currently available algorithms are well developed for protein side chain prediction upon fixed backbone templates. This study shows the preference of backbone atoms in MHC-peptide binding and hence emphasizes the need for accurate peptide backbone prediction in quantitative MHC-peptide binding calculations.
Eukaryotes have both intron-containing and intron-less genes and their proportion varies from species to species. Most eukaryotic genes are multi exonic with their gene structure being interrupted by introns. Introns account for a major proportion in many eukaryotic genomes. For example, the human genome is proposed to contain 24% introns and only 1.1% exons (Venter et al. 2001). Although most genes in eukaryotes contain introns, there are a substantial number of reports on intronless genes. We recently created a database (SEGE) for intronless genes in eukaryotes using GenBank 128 sequence data (http://intron.bic.nus.edu.sg/seg/). The eukaryotic subdivision files from GenBank were used to create a dataset containing entries that are reservedly considered as single exonic genes according to the CDS FEATURE convention. Single exon genes with prokaryotic architectures are of particular interest in gene evolution. Our analysis on this set of genes shows that structures are known for nearly 14% of their gene products. The characteristics and structural features of such proteins are discussed in this presentation. Reference
1.
Shell Biodiversity Using Animation Technology
In the frog dissection
system (http://ruby.kisti.re.kr/~museumfs), virtual dissection
is enabled in order to eliminate these undesired effects and
the factuality of organs is disguised using Photoshop to minimize
the dislike of and aversion of students to the dissection process.
In addition, the system was designed in such a way that, once
a student replaces the dissected organs after observation is
done, a frog is reanimated and jumps around so that the student
does not treat the subject without care but instead treats it
with respect for its life. Antarctica, the
southernmost continent is a landmass of around 1.36 million
square kilometers 98 percent covered by ice up to 4.7 kilometers
thick. The continent remained neglected for decades after discovery,
scientific research was initiated in early 1940s. Two species
of phanerogams have been reported, whereas most of studies are
carried out on cryptogams like algae, lichens and bryophytes.
There are 700 species of terrestrial and aquatic algae in Antarctica,
250 lichens and 130 species of bryophytes including100 species
of mosses and 25-30 species of liverworts. The species composition
and abundance are controlled by many environmental variables,
such as nutrients, availability of water and increased ultraviolet
radiation resulting from the depletion of the ozone hole. These
cryptogams can be found in almost all areas capable of supporting
plant life in Antarctica and exhibit a number of adaptations
to the Antarctic environment. There is a need to apply molecular
and cellular techniques to study biodiversity and genetic characteristics
of flora of this region. Biochemical techniques including DNA
sequencing and microsatellite markers are being used to obtain
information about the genetic structure of plant populations.
These analyses are designed to assess levels of biodiversity
and to provide information on the origin, evolutionary relationships
and dispersal patterns. Flora of Antarctica needs to be genetically
evaluated for the characters related to survival in that unique
environment that can be incorporated into the economically important
plants using transformation. 4.
Automatic Mapping and Monitoring of Invasive Alien Plant Species,
the South African Experience Chinese Biodiversity
Information System (CBIS) is a nation-wide distributed information
system that collects, arranges, stores and disseminates data/information
refers to biodiversity in China. It consists of a center system,
5 disciplinary divisions and dozens of data source. The Center
System of CBIS is located in the Institute of Botany, Chinese
Academy of Sciences, Beijing. The 5 divisions are Zoological
Division (in Institute of Zoology, CAS, Beijing), Botanical
Division (in Institute of Botany), Microbiological Division
(in Institute of Microbiology, CAS, Beijing), Inland Wetland
Biological Division (in Institute of Hydrobiology, CAS, Wuhan)
and Marine Biological Division (in South China Sea Institute
of Oceanology, CAS, Guangzhou). The data sources cover 15 institutes
in CAS and includes botanical garden, field research station,
museum, cell bank, seed bank, culture collection and research
group. The Center System is response for building up and maintaining
integrated and national-scale biodiversity database, environmental
factor and vegetation database, model base and expert system
in ecosystem level, and platform and tools of modeling and expert
system. The Disciplinary Divisions are response for building
up and maintaining database, model base and expert systems on
their fields focused on data and information of species level.
Data Sources are response for building up and maintaining database
based on their local situation and disciplinary character, combining
with GIS technology to present biodiversity information and
data both in table and graphics. In order to conserve
and protect the very rich biological resources that have evolved
in a unique natural environment, the government in Taiwan has
set up a special committee and assigned a government agency,
both at the cabinet level, to be in charge of planning and implementing
relevant programs, respectively. Convening Prospects of
Biodiversity, Biodiversity-1999 and Biodiversity in the 21st
Century symposia has been the main means of building the
national consensus to identify issues to be studied, which have
motivated scientists to initiate the challenging task with the
support of research funding from related agencies. There are
6 national parks, 18 nature reserves, 13 wildlife and 24 nature
protection areas, totally covering 12.2% of the land area. The
Policy Formulating Committee for Climate Changes has recommended
the enforcement of education on biodiversity (including all
levels of school and general public education), and formulated
the working plans on the national biodiversity preservation
and bioresources survey. The research programs in progress,
supported by the national funding, include surveys on species,
habitants, ecosystems and genetic diversities, long-term monitoring
of diversity, sustainable bioresource utilization and compilation
of flora of Taiwan. Increase in the number of scientific publications
and increased emphasis placed by news media show the increased
concern of both academic and public domains on biodiversity
issue. Besides, the material and information databases related
to the biological resources of various categories have been
established and revised regularly. The following bioscience
databases have been established in Taiwan: National plant genetic
resources information system, Multimedia databank of Taiwan
wildlife, Taiwan Agricultural Institute plant information system,
Distribution and resources of fishes in Taiwan, Herbaria at
many sites, Cell bank, Asian vegetable genetic resources and
seeds, Database of pig production, Registry of pure-bred swine,
Mating, furrowing, performance and transfer of ownership of
pure-bred swine, Food marketing information system database,
Food composition table in Taiwan, Database on heavy metals in
Taiwan soils, Greenhouse gases emission from agriculture, Global
change database generated in Taiwan.
1.
Unweaving regulatory networks: automated extraction from literature
and
3.
PIR Integrated Databases And Data-Mining Tools For Genomic And
Proteomic Research
4.
Extraction of Phylogenetic Information from Gene Order Data Molecular phylogeny is frequently inferred from comparisons of nucleic or amino acid sequences of a single gene or protein family from different organisms. It is now known that there are a number of difficulties with this approach, for instance, correct alignment of sequence data, biased base (or amino acid) compositions among species, rate variation among sites and/or species, mutational saturation, and long-branch attraction artifact. Thus, development of new methods that can produce a reliable phylogenetic tree is an important issue. Here we present a simple method of reconstructing branching orders among genomes based on gene transpositions. We demonstrate that the occurrence or absence of a gene transposition event could provide empirical evidence for branching orders, being in contrast to the phenetic approaches of overall similarity or minimum distance. This approach is applied to evolutionary relationships among the completely sequenced Gram-positive bacteria. The complete genomic sequence data allow one to search for the target gene transpositions at a comprehensive level.
|
||||
Last site update: 15 March 2003
|