New Mexico Supercomputing Challenge

Model of Disease Propagation to Predict the Speed of an Outbreak

Team: 6

School: Albuquerque Academy

Area of Science: Medicine and Computer Science


Interim: 
1. Definition of the Problem


Influenza is an infectious disease that spreads through a population via minscule droplets of infected saliva that are released from the infected person’s mouth and nose. This means that it can be spread from a distance, so it can be easily and rapidly transported from one person to the next. Influenza can be dangerous when it gets out of hand, and in some cases deadly. A way to predict the speed and scale of an outbreak would help medical authorities be prepared for an outbreak with sufficient materials and an idea of how long it could take for the virus to go out of control.


2. Plan for Computationally Solving the Problem

We have chosen to investigate the propagation of a disease. We will accomplish this using computer modeling to predict how fast a disease will propagate through a selected region. This is important because areas that are prone to disease can know how to prepare for potential outbreaks with the most effective measures. We plan to model this problem as effectively as possible. We will create this model using C++, with numerous variables to show how fast the virus will spread. We will complete our task by dividing the work equally and using medical data on previous outbreaks, population and transportation maps, and information on how the disease spreads to create a mathematical model, then a full-fledged program. Our program will utilize an Agent-based modeling system with multiple Agents each representing a person. These Agents will move on a semi random path over a grid. When two or more Agents, of which one at least is infected, are within a suitable distance of each other, the program will use each Agent’s properties to calculate the likelihood of the uninfected Agent catching influenza from the infected Agent. These properties will include immunity and the amount of time an infected Agent will take to get well, among others.


3. Description of the Progress Made to Date

Presently, an initial C++ OpenGL virus predictor has successfully been created.  This code contains an Agent Factory where each Agent is an instance of a C++ struct.  A mock-up database of Agents is read into each Agent's struct from an ASCII flat file that was created.  Initial condition attributes utilized within the C++ code for each Agent include the following: name, contagious (yes/no), normalized latitude and longitude coordinates, health (1-10), and time-to-get-well.  The attribute not yet used is travel, however currently a random number generator is independently used to model the movement of each Agent at each time iteration.  When the program starts, the mock-up database is read and Agents are created with their initial conditions and their Lat/Lon positions are displayed in a normalized coordinate plane within an OpenGL window.  Infected Agents are red symbols and healthy Agents are green.  As the simulation runs, the Agents' positions are visually displayed as red or green symbols.  Each simulation step is in the milliseconds and each Agent can move only a few steps.  At each increment of time an infected Agent may move and touch (or get close to) a healthy Agent, when that occurs, the health of the healthy Agent goes down.  When an Agent's health is zero, it gets infected.  Each Agent has their own stop watch where after a period of time they get well.  No consideration has been given yet to Agents who do not survive.  During this simulation, Agents can be seen moving around the OpenGL window, some of which are infected (red), others which are healthy (green).

Other progress attained is the identification of raw data which was downloaded in ASCII file format containing Influenza Virus Databases from the Center for Disease Control and Prevention (CDC) and the Influenza Research Database (IRD) [Squires et al. (2012) Influenza research database: an integrated bioinformatics resource for influenza research and surveillance. Influenza and Other Respiratory Viruses DOI: 10.1111/j.1750-2659.2011.00331.x. ] The raw data obtained was entered into the database on a weekly basis. This is the granularity of updates observed so far. This initial data has been compiled and converted into Microsoft Excel plots for initial analysis. Additionally, we have identified data on transportation. This may be used to determine where an Agent may travel. Our mathematical model is currently under progress, and is waiting for the data to be compiled. It will calculate the probability of infected Agents spreading their infection.


4. Expected Results

The goal of this project is to create a rule based mathematical model of the spread of disease through a community. The program will use an Agent based modeling system to determine the likelihood of each Agent’s infection, and model the spread from Agent to Agent over a certain amount of time using the probabilities we have calculated. It would then be able to state how many Agents have become infected after a certain amount of time has passed. We expect, when our efforts are compiled and amalgamated into a final result, a functional program should predict an outbreak of influenza within a given error. It will output using symbols on a graphical display, with colors to represent their state of infection. The program will utilize medical information about influenza, as well as data on past epidemics and travel data. It will use a mathematical model to determine the probability of infected Agents infecting uninfected Agents.


5. Citations

Crochemore, M., Hancart, C., Lecroq, T.: Algorithms on strings. Cambridge University Press, Cambridge (2007)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: A review. ACM Computing Survey 31, 264–323 (2000)
National Center for Biotechnology Information (NCBI): Influenza virus resources (2008),
Russell, C.A., Jones, T.C., Barr, I.G., Cox, N.J., Garten, R.J., et al.: The global circulation of seasonal influenza A (H3N2) viruses. Science 320, 340–346 (2008)
Zhang T, Soh SH, Fu X, Lee KK, Wong L, Ma S, Xiao G, Kwoh CK. HPCgen a fast generator of contact networks of large urban cities for epidemiological studies. International Conference on Computational Intelligence, Modeling and Simulation. 2009. pp. 198–203.
Martín G, Marinescu MC, E Singh D, Carretero J. EpiGraph: a scalable simulation tool for epidemiological studies. The 2011 International Conference on Bioinformatics and Computational Biology. 2011. pp. 529–536.
Keeling MJ, Eames KT. Networks and epidemic models. J R Soc Interface. 2005;2(4):295–307. doi: 10.1098/rsif.2005.0051.
Newman M. Spread of epidemic disease on networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2002;66:016128.
Read J, Eames K, Edmunds W. Dynamic social networks and the implications for the spread of infectious disease. J R Soc Interface. 2008;5(26):1001. doi: 10.1098/rsif.2008.0013.
Coelho FC, Cruz OG, Codeco CT. Epigrass: a tool to study disease spread in complex networks. Source Code Biol Med. 2008;3:3. doi: 10.1186/1751-0473-3-3.
Chung F, Lu L. Connected components in random graphs with given expected degree sequences. Annals of Combinatorics. 2002;6:125–145. doi: 10.1007/PL00012580.
Volz E, Meyers L. Susceptible-infected-recovered epidemics in dynamic contact networks. Proc Biol Sci. 2007;274(1628):2925. doi: 10.1098/rspb.2007.1159.
Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, Massari M, Salmaso S, Tomba G, Wallinga J. et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PloS Med. 2008;5(3):e74. doi: 10.1371/journal.pmed.0050074.
Bian L. A conceptual framework for an individual-based spatially explicit epidemiological model. Environment and Planning B. 2004;31(3):381–396. doi: 10.1068/b2833.
Mao L, Bian L. Spatial-temporal transmission of influenza and its health risks in an urbanized area. Computers, Environment and Urban Systems. 2010;34(3):204–215. doi: 10.1016/j.compenvurbsys.2010.03.004.
Carley K, Fridsma D, Casman E, Yahja A, Altman N, Chen LC, Kaminsky B, Nave D. BioWar: scalable agent-based model of bioattacks. IEEE Transactions on Systems, Man and Cybernetics. 2006;36(2):252–265.
Eichner M, Schwehm M, Duerr HP, Brockmann S. The influenza pandemic preparedness planning tool InfluSim. BMC Infect Dis. 2007;7:17. doi: 10.1186/1471-2334-7-17.
Miritello G, Moro E, Lara R. Dynamical strength of social ties in information spreading. Phys Rev E Stat Nonlin Soft Matter Phys. 2011;83(4):045102.
Longini IM, Halloran EM, Nizam A, Yang Y. Containing pandemic influenza with antiviral agents. Am J Epidemiol. 2004;159(7):623–633. doi: 10.1093/aje/kwh092. 



6. Team Members:

Dylan Hendrickson, Marcus Dominguez-Kuhne, Craig Short, Maxwell Johnson


7. Sponsoring Teacher:

Mr. Jim Mims.



Team Members:

  Max Johnson
  Marcus Dominguez-Kuhne
  Dylan Hendrickson
  Craig Short

Sponsoring Teacher: Jim Mims

Mail the entire Team

For questions about the Supercomputing Challenge, a 501(c)3 organization, contact us at: consult @ challenge.nm.org

New Mexico Supercomputing Challenge, Inc.
Post Office Box 30102
Albuquerque, New Mexico 87190
(505) 667-2864

Supercomputing Challenge Board of Directors
Board page listing meetings and agendas
If you have volunteered for the Challenge, please fill out our In Kind form.