Overview

iDigBio, short for Integrated Digitized Biocollections, functions as a national resource for the digitization and access of natural history collections in the United States. Established in 2011, its primary mission is to promote and support the creation of a comprehensive, freely available digital record of all biological collections. This initiative addresses the challenge of disparate data sources by aggregating specimen data from numerous institutions, making it accessible through a unified portal and a programmatic API.

The iDigBio API is designed for researchers, educators, and developers who require programmatic access to millions of digitized specimen records. These records encompass a wide range of biological taxa, including plants, animals, fungi, and microbes, along with associated metadata such as collection locality, date, collector, images, and genetic data. The API supports various query parameters, allowing users to filter data by scientific name, geographic coordinates, collection date, institution code, and other relevant fields. This capability is crucial for large-scale ecological studies, biodiversity assessments, conservation planning, and climate change research, where access to vast datasets is essential for identifying patterns and trends.

For example, an ecologist might use the iDigBio API to retrieve all records of a specific plant species across a particular geographic region over several decades to analyze shifts in distribution due to environmental changes. A museum curator could query the API to discover gaps in their own collection data or identify specimens held by other institutions that complement their research. The platform also supports educational initiatives by providing data for virtual field trips and citizen science projects, enabling broader engagement with biodiversity science. The developer experience notes indicate that the API's documentation, including examples, facilitates straightforward integration for diverse applications.

iDigBio is particularly valuable for projects requiring a broad scope of biological data without the need to individually access each contributing institution's database. Its focus on U.S. collections complements global initiatives like the Global Biodiversity Information Facility (GBIF), which aggregates biodiversity data worldwide. By centralizing access to this rich dataset, iDigBio reduces the technical overhead for researchers and promotes interoperability within the biodiversity informatics community. The platform's commitment to open data ensures that this valuable scientific resource remains freely available for public good and scientific advancement.

Key features

  • Specimen Data Access API: Programmatic interface for querying and retrieving millions of digitized natural history specimen records, including plants, animals, fungi, and microbes.
  • Comprehensive Metadata: Access to detailed information for each specimen, such as scientific name, taxonomy, collection locality (coordinates), collection date, collector names, institutional codes, and associated media (e.g., images).
  • Advanced Query Capabilities: Supports filtering data by various parameters including scientific names, higher taxonomy, geographic bounding boxes, political divisions, collection dates, and specific institution/collection codes.
  • Data Aggregation: Centralizes data from hundreds of natural history collections across the United States, providing a unified access point.
  • Open Data Policy: All data and services are freely available to the public, supporting open science principles and broad accessibility for research and education.
  • Data Quality and Curation: Provides tools and guidance for data providers to improve the quality, accuracy, and completeness of digitized specimen data.
  • Virtual Field Trips: Offers resources and data to create virtual educational experiences based on natural history collections.
  • Community Support: Actively engages with the biodiversity informatics community, offering workshops, training, and resources for data digitization and utilization.

Pricing

As of May 2026, iDigBio operates under an open access model, meaning all its services and data are provided free of charge to all users. There are no subscription fees, usage-based charges, or premium tiers for accessing the API or the data portal.

Service Tier Features Cost (USD)
All Services Full API access, data portal access, educational resources, community support Free

For detailed information on accessing the API and data, refer to the official iDigBio API documentation.

Common integrations

The iDigBio API is typically integrated into scientific workflows and applications that require large-scale biodiversity data. Common integration patterns include:

  • Ecological Modeling Platforms: Researchers integrate iDigBio data into environmental niche modeling software (e.g., Maxent, R packages like 'dismo') to predict species distributions based on environmental variables and occurrence records.
  • Geographic Information Systems (GIS): Data from iDigBio can be imported into GIS software (e.g., ArcGIS, QGIS) for spatial analysis, mapping species occurrences, and visualizing biodiversity patterns. The ArcGIS REST API documentation provides examples of how spatial data can be queried and visualized.
  • Data Visualization Tools: Developers use iDigBio data with libraries like D3.js or mapping frameworks (e.g., Leaflet.js, Google Maps Platform) to create interactive maps and visualizations of specimen distributions.
  • Research Databases and Portals: Other scientific data portals or institutional repositories may integrate iDigBio data to enrich their own datasets or provide broader access to specimen information.
  • Educational Applications: Data is used in educational software or online modules to teach students about biodiversity, ecology, and data science through real-world examples.
  • Bioinformatics Pipelines: Integration into custom scripts and bioinformatics pipelines for tasks such as data cleaning, taxonomic reconciliation, and linking specimen data with genetic sequence information.

Alternatives

  • GBIF: The Global Biodiversity Information Facility is an international network and data infrastructure providing free and open access to biodiversity data worldwide.
  • VertNet: A collaborative project that makes biodiversity data available online, primarily focusing on vertebrate collections.
  • BioPortal: A repository of biomedical ontologies and terminologies, offering tools to manage and access structured biological knowledge.

Getting started

To get started with the iDigBio API, you can make HTTP GET requests to its RESTful endpoints. The primary endpoint for querying records is /records. Here's a basic example using Python's requests library to fetch records for a specific scientific name.

import requests
import json

# Base URL for the iDigBio API
BASE_URL = "https://api.idigbio.org/v2"

# Define query parameters
# This example searches for records of 'Panthera tigris' (Tiger)
params = {
    "rq": "scientificname:Panthera tigris",
    "limit": 5  # Limit to 5 records for brevity
}

# Make the API request
try:
    response = requests.get(f"{BASE_URL}/records", params=params)
    response.raise_for_status()  # Raise an exception for HTTP errors

    # Parse the JSON response
    data = response.json()

    # Print some details from the retrieved records
    print(f"Found {data['itemCount']} records for Panthera tigris.")
    print("--- First 5 Records ---")
    for record in data['items']:
        uuid = record.get('uuid', 'N/A')
        scientific_name = record.get('data', {}).get('dwc:scientificName', 'N/A')
        locality = record.get('data', {}).get('dwc:locality', 'N/A')
        institution_code = record.get('data', {}).get('dwc:institutionCode', 'N/A')

        print(f"UUID: {uuid}")
        print(f"  Scientific Name: {scientific_name}")
        print(f"  Locality: {locality}")
        print(f"  Institution: {institution_code}")
        print("-----------------------")

except requests.exceptions.RequestException as e:
    print(f"API Request failed: {e}")
except json.JSONDecodeError:
    print("Failed to decode JSON response.")

This Python script demonstrates how to construct a query for a specific scientific name and parse the JSON response to extract key information from the returned specimen records. The rq parameter supports a powerful query language, allowing for complex searches across various data fields. For more advanced queries and detailed parameter descriptions, consult the iDigBio API documentation.