The NetHealth Project is a multi-year longitudinal study of social networks and health funded by the National Institutes of Health and the National Health, Lung and Blood Institute (5R01HL117757-04).  The project collected a wide range of data from a cohort of Notre Dame undergraduate students who entered Notre Dame in the Fall of 2015 and graduated in the Spring of 2019. Around 700 students were initially recruited for a two-year study, which was extended to 4 years with around 300 of these students.  The study involved the collection of a wide range of data including information on each participant’s social networks, physical activity and sleep obtained from Fitbit devices, and physical and mental health, social-psychological states, tastes and various self-reported behaviors, and demographics and background traits through surveys.   

Through this website we are releasing a good deal of the data that was collected.  This website documents and provides access to the various data sets that we are releasing . 

This data is confidential and was obtained with informed consent from study participants through approved IRB protocols designed to ensure confidentiality and maintain the security of the data.  We have implemented a case identification system that anonymizes the data while allowing researchers to link together data from different sources. Details on this system are in the Data Linkage and Identification section under the Data Collection heading where we also provide detailed descriptions on each of these data collection efforts and resulting datasets, as well as information on the Sample:

  • Communication Data:  Event (time-stamped) records of all communication events logged on a study participant’s smartphone.  This is the information used to construct social networks (arc and edge lists) of participant’s communicative interactions with other people. 
  • Basic Survey Data:  8 waves of data collected through surveys completed online by participants every 3-4 months.  
  • Network Survey Data:  8 waves of data collected through surveys every 3-4 months in which students were asked to name and provide information on up to 20 people with whom they had interacted.  
  • Fitbit Data (Daily Activity and Daily Sleep Data):  Daily data on a variety of measures computed from Fitibt sensors such as number of steps, active minutes,  average daily heart rate, minutes in various heart rate zones, beginning and ending times of sleep periods, sleep duration, and sleep interruptions.
  • Courses and Grades Data:  For a select sample of participants who consented, we have administrative records from the Notre Dame Registrar’s Office containing course and grade information on each course taken by a consenting student during their four years at Notre Dame. 
  • Calendar:  The data collection lasted 194 weeks.  We provide a weekly calendar for researchers detailing when classes began, break weeks, holidays, etc.  

Collecting, managing and cleaning all this data was a large and complicated task involving many people including engineers, computer scientists, and social scientists over a five year people (see People section).  One novel aspect of this study is the collection of both self-reports on networks and behaviors through periodic surveys and the collection of behavioral data through remote, always-on mobile sensors, such as Fibtit devices and smartphones.   All participants were given a Fitbit device which they periodically synced with the Fitbit cloud from which we then retrieved with an API the Fitbit data and stored on our server. All participants installed apps on their smartphones and computers that allowed us to retrieve daily logs of communication events.  Through these apps, sensor output data was retrieved from smartphones and Fitbit devices and sent to our secure servers for processing and storage in a database system.

Details on the documentation are discussed in the Documentation section and include:

  • Codebooks for each dataset
  • Tables with information on codes and scales employed
  • Supplemental Material with detailed information on various aspects of the data collection.