Introduction

This project focuses on creating a secure and scalable infrastructure to support data acquisition, transformation, visualization and analysis for the Southeast Regional Drug Data Research Center (SR-DDRC) at the University of Alabama Institute of Data and Analytics. The primary goal of the technical aspect of this initiative is to collect and analyze demographic, geographic, socioeconomic and other social determinants of health data relevant to drug and opioid crisis to better inform future policy and intervention. The data gathered in this effort will also support future research into various areas related to drug and opioid misuse.

This document serves as a technical guide for other research institutions interested in replicating portions of this project in different geographic areas or research areas. It provides and overview of the technical infrastructure, including the setup of secure network environments, server infrastructure, data acquisition methods, and data transformation workflows. The document also outlines the tools and best practices used to ensure data quality, security and scalability. 

Conclusion

In this technical assistance document, we have outlined the comprehensive framework used for our data acquisition, transformation, and visualization project. Starting with a secure infrastructure, we discussed the methods of data acquisition from various sources, emphasizing our focus on raw text files and the importance of regular updates and versioning. The subsequent data cleaning and transformation processes, utilizing T-SQL, Python, and PowerShell, ensure data quality and integrity, supported by robust validation techniques and visual inspections through Power BI. We detailed the structure of our final production tables, highlighting their optimization for performance, and elaborated on our deployment strategies using Azure DevOps and Git for version control. Finally, ongoing maintenance practices, including monitoring and regular updates, are essential to maintaining a reliable system. This document serves as a blueprint for other research institutions aiming to replicate our efforts in data management and analysis, with further resources available on our website for specific processes.