Data management and analysis

Photo by Mika Baumeister on Unsplash

These are standalone data management and analysis projects that are available for volunteers who want to gain experience with real-world data. Some projects may also result in authorship on a publication.

Current opportunities available

  1. Merge DETECT data with APS data

    • I’m looking for someone who can help me link records across two administrative datasets that don’t share a unique identifier. Instead, we will need to probabilistically match people across datasets based on name, date of birth, and address. We will likely use R’s RecordLinkage package, R’s fastLink package, or Python’s dedupe library to accomplish this.
Brad Cannell
Brad Cannell
Associate Professor of Epidemiology

Dr. Cannell is an epidemiologist at one of the largest health science centers in the world, where he uses data-driven approaches to improve the lives of older adults and the health of the communities they live in.