Data Engineering Research Assistant - Healthcare Pricing Data Infrastructure
General qualifications for the position:
- Must have strong experience with SQL and relational database design (e.g., PostgreSQL or MySQL)
- Must be proficient in Python/R for data ingestion and transformation (e.g., using pandas, json, sqlalchemy)
- Must be comfortable working with large and complex datasets, especially in structured and semi-structured formats (e.g., JSON, CSV)
- Must be able to design and implement a clean, scalable database schema, and populate it with data extracted from public sources
- Strong attention to detail and ability to work independently
ยทย Preferred (not required):
- Interest in health data, informatics, or policy analytics
- Experience with cloud-based data platforms (e.g., AWS Athena, Google BigQuery) or big data tools (e.g., PySpark)
- Familiarity with data visualization or exploratory analysis (e.g., using Python, Tableau, or Power BI)
- Preference will be given to students who started working towards their masters degree in Spring 2026
10-15 hours per week