Sources
Data (updated: June-2022)
-
Project Open Data Dashboard: The Project Open Data Dashboard is a website enabling Federal agencies, industry, and the general public and other stakeholders to view details on how Federal agencies are progressing on implementing M-13-13 Open Data Policy—Managing Information as an Asset.
-
getTBinR: Access and Summarise World Health Organization Tuberculosis Data.
-
bomrang: Australian Government Bureau of Meteorology (BOM) Data Client.
-
tidycensus: allows users to interface with a select number of the US Census Bureau’s data APIs and return tidyverse-ready data frames.
-
Public Health England’s Fingertips, and a corresponding package fingertipsR.
-
mapsapi: Google Maps APIs. Credits are needed (https://cloud.google.com/edu/faculty).
-
mozzie: A weekly notified dengue cases in Sri Lanka (2008 to 2014). For later years, data are presented at Epidemiology Unit Ministry of Health.
-
Statistics Denmark, and a corresponding package statsDK.
-
Some introductions:
The Google Dataset Search Engine
Google unveils search engine for open data
Google Dataset Search: Building a search engine for datasets in an open Web ecosystem
-
Kaggle Data: Data science company.
-
NHANES: The National Health and Nutrition Examination Survey.
-
SEER: The Surveillance, Epidemiology, and End Results (SEER) Program provides cancer statistics of the U.S. population.
-
CDC WONDER: Wide-ranging ONline Data for Epidemiologic Research system provided by the Centers for Disease Control and Prevention (CDC).
-
HCUP: The Healthcare Cost and Utilization Project (HCUP), the largest collection of longitudinal hospital care data in the United States.
-
CSDR: Clinical Study Data Request, a consortium of clinical study Sponsors. An Industry Experience with Data Sharing.
-
EMA: Clinical data published under the European Medicines Agency (EMA) policy.
-
MIMIC: Medical Information Mart for Intensive Care.
-
Data.gov: U.S. Government’s open data.
-
GHDx: Global Health Data Exchange (GHDx), a catalog of global health and demographic data.
-
VAERS: Vaccine Adverse Event Reporting System (VAERS).
-
NEMSIS: The National Emergency Medical Services Information System.
-
ICPSR: Inter-university Consortium for Political and Social Research, a data archive of more than 250,000 files of research in the social and behavioral sciences.
-
Project Tycho 2.0: data of infectious disease epidemiology and global health informatics.
-
VIPR: Virus pathogen resource, a bioinformatics data source.
-
VEuPathDB: The Eukaryotic Pathogen, Vector and Host Informatics Resource (VEuPathDB).
-
ImmPort: Immunology Database and Analysis Portal.
-
IRD: Influenza research database.
-
FluNet: a global web-based tool for influenza virological surveillance. Global Influenza Surveillance and Response System (GISR).
-
GISAID: a global science initiative and primary source, a bioinformatics data source.
-
DHS: The Demographic and Health Surveys (DHS) Program
Collections
-
ClinEpiDB: ClinEpiDB integrate data from high quality epidemiological studies.
-
Scientific data: a peer-reviewed open-access journal for descriptions of datasets and research
Techniques
-
Official Kaggle Blog: “interviews from top data science competitors.”
-
rOpenSci: “rOpenSci fosters a culture that values open and reproducible research using shared data and reusable software.”
-
Spatial Point Patterns: Methodology and Applications with R, and a corresponding package spatstat