-
Notifications
You must be signed in to change notification settings - Fork 2
Description
Aims
Download & clean up the CDC's data
Note the number of weeks in each year (yes, in CDC terms we can have more or less than 52 weeks a year!?!)
Note that seasonal H1N1 and pandemic H1N1 are considered different strains, please assign their lines separate colors in all graphs
Note other messy things about the data and decide what to do about it
Generate graphs
- flu ILI
- Subtype isolate graphs
- for the 1st draft, please show all the age groups together
- if you have more time, feel free to separate the age groups
- there is some interesting stuff to observe from these
Background
Influenza Like Illness (ILI) is a good readout of relative disease severity caused by influenza viruses from year to year. Normally, H3N2 is associated with more severe seasons than H1N1. However, pandemics are also associated with more disease severity.
Looking at graphs of the CDC data can help select sets of viruses to develop models that answer different questions related to predicting viral evolution.
I made some Jupyter Notebooks to process and graph the CDC data on ILI early in my journey to become a data scientist. My first notebook explains how to get the data (among my other notes as I was learning) and my second notebook, in Out[15], shows a graph of ILI. These will provide a good starting point. If I were you I'd start with a fresh, empty notebook :)