Sources

Data (updated: June-2022)

  1. Project Open Data Dashboard: The Project Open Data Dashboard is a website enabling Federal agencies, industry, and the general public and other stakeholders to view details on how Federal agencies are progressing on implementing M-13-13 Open Data Policy—Managing Information as an Asset.

  2. getTBinR: Access and Summarise World Health Organization Tuberculosis Data.

  3. bomrang: Australian Government Bureau of Meteorology (BOM) Data Client.

  4. tidycensus: allows users to interface with a select number of the US Census Bureau’s data APIs and return tidyverse-ready data frames.

  5. Public Health England’s Fingertips, and a corresponding package fingertipsR.

  6. mapsapi: Google Maps APIs. Credits are needed (https://cloud.google.com/edu/faculty).

  7. mozzie: A weekly notified dengue cases in Sri Lanka (2008 to 2014). For later years, data are presented at Epidemiology Unit Ministry of Health.

  8. Statistics Denmark, and a corresponding package statsDK.

  9. Google Dataset Search.

    Some introductions:

    The Google Dataset Search Engine

    Google unveils search engine for open data

    Google Dataset Search: Building a search engine for datasets in an open Web ecosystem

  10. Kaggle Data: Data science company.

  11. NHANES: The National Health and Nutrition Examination Survey.

  12. SEER: The Surveillance, Epidemiology, and End Results (SEER) Program provides cancer statistics of the U.S. population.

  13. CDC WONDER: Wide-ranging ONline Data for Epidemiologic Research system provided by the Centers for Disease Control and Prevention (CDC).

  14. HCUP: The Healthcare Cost and Utilization Project (HCUP), the largest collection of longitudinal hospital care data in the United States.

  15. CSDR: Clinical Study Data Request, a consortium of clinical study Sponsors. An Industry Experience with Data Sharing.

  16. EMA: Clinical data published under the European Medicines Agency (EMA) policy.

  17. MIMIC: Medical Information Mart for Intensive Care.

  18. Data.gov: U.S. Government’s open data.

  19. GHDx: Global Health Data Exchange (GHDx), a catalog of global health and demographic data.

  20. VAERS: Vaccine Adverse Event Reporting System (VAERS).

  21. NEMSIS: The National Emergency Medical Services Information System.

  22. WHO data collections

  23. ICPSR: Inter-university Consortium for Political and Social Research, a data archive of more than 250,000 files of research in the social and behavioral sciences.

  24. HealthData.gov

  25. Project Tycho 2.0: data of infectious disease epidemiology and global health informatics.

  26. VIPR: Virus pathogen resource, a bioinformatics data source.

  27. VEuPathDB: The Eukaryotic Pathogen, Vector and Host Informatics Resource (VEuPathDB).

  28. ImmPort: Immunology Database and Analysis Portal.

  29. IRD: Influenza research database.

  30. FluNet: a global web-based tool for influenza virological surveillance. Global Influenza Surveillance and Response System (GISR).

  31. GISAID: a global science initiative and primary source, a bioinformatics data source.

  32. DHS: The Demographic and Health Surveys (DHS) Program

  33. Union Army Data

Collections

  1. zenodo

  2. NIH Data Sharing Repositories

  3. ClinEpiDB: ClinEpiDB integrate data from high quality epidemiological studies.

  4. Scientific data: a peer-reviewed open-access journal for descriptions of datasets and research

Techniques

  1. Official Kaggle Blog: “interviews from top data science competitors.

  2. rOpenSci: “rOpenSci fosters a culture that values open and reproducible research using shared data and reusable software.

  3. Spatial Point Patterns: Methodology and Applications with R, and a corresponding package spatstat

  4. Introduction to Functional Data Analysis with R

  5. Survey analysis in R

  6. Analyze Survey Data for Free

Previous