In order to understand the epidemic and make predictions about it, researchers require quality data about cases of the disease. While many services use newspaper articles or tweets to estimate Ebola outbreak severity, true indicence data broken down by date is the most useful. However, much of this information is provided as PDF format reports, which are very difficult to import into statistical software. By digitizing the data into spreadsheets, we are providing data in a format that can be easily used for analysis.
Who and Where?
Caitlin Rivers, a PhD student in computation epidemiology at the Virginia Tech Network Dynamics and Simulation Science Lab, has compiled a fantastic GitHub repository for Ebola incidence and case data. GitHub is an open source hosting service for projects, including data, documents, and code.
We will be digitizing reports from Guinea.