Clinical and translational research data often exist across different sources—including electronic health record (EHR), electronic data capture (EDC), and biobank systems—and require custom integration to become amenable to analysis. By bringing together data from disparate applications, research data repositories (RDRs) enable investigators to ask multiple research, quality, and clinical questions. RDRs provide authorized personnel with access to self-service tools for discovery (i.e. generating hypotheses), collection (i.e. recording novel measures), and preparation (e.g. testing hypotheses) of data to support retrospective and prospective studies. Creation of an RDR requires approval from the ARCH Scientific Advisory Board as well as a financial commitment from investigators.