Skip to main content
Taylor & Francis Group Logo
    Advanced Search

    Click here to search products using title name,author name and keywords.

    • Login
    • Hi, User  
      • Your Account
      • Logout
      Advanced Search

      Click here to search products using title name,author name and keywords.

      Breadcrumbs Section. Click here to navigate to respective pages.

      Chapter

      Biological Databases and Integration
      loading

      Chapter

      Biological Databases and Integration

      DOI link for Biological Databases and Integration

      Biological Databases and Integration book

      Biological Databases and Integration

      DOI link for Biological Databases and Integration

      Biological Databases and Integration book

      BySumeet Dua, Pradeep Chowriappa
      BookData Mining for Bioinformatics

      Click here to navigate to parent product.

      Edition 1st Edition
      First Published 2012
      Imprint CRC Press
      Pages 40
      eBook ISBN 9780429122217
      Share
      Share

      ABSTRACT

      This chapter describes the intricacies involved in handling prevalent databases used in bioinformatics. The evolutionary nature of the biological data renders unique characteristics that are describes as highly heterogeneous, large in data volume, dynamic, hierarchical, not standardized, lacking database management applications and data access tools for biological databases, and data integration and annotation. The categorization aims to differentiate biological databases into two categories, systems point solution and general solution databases. Gene expression data, raw data are obtains in the form of microarray chip images, a product of the microarray experiment. The Protein Data Bank (PDB) is one of the largest repositories of known protein structures. The inherent large number of dimensions, called the curse of dimensionality, has ubiquitous effects throughout the sciences, specifically in bioinformatics. In multisource integration, the problems faced are derivatives of the problems of each independent source. Data cleaning that uses domain knowledge to duplicate record identification and for de–duplication is a necessary component of data preprocessing.

      T&F logoTaylor & Francis Group logo
      • Policies
        • Privacy Policy
        • Terms & Conditions
        • Cookie Policy
        • Privacy Policy
        • Terms & Conditions
        • Cookie Policy
      • Journals
        • Taylor & Francis Online
        • CogentOA
        • Taylor & Francis Online
        • CogentOA
      • Corporate
        • Taylor & Francis Group
        • Taylor & Francis Group
        • Taylor & Francis Group
        • Taylor & Francis Group
      • Help & Contact
        • Students/Researchers
        • Librarians/Institutions
        • Students/Researchers
        • Librarians/Institutions
      • Connect with us

      Connect with us

      Registered in England & Wales No. 3099067
      5 Howick Place | London | SW1P 1WG © 2022 Informa UK Limited