Principal Data Engineer
VillageMD
5d ago
0$170k - $195kDataUnited Stateshimalayas
Data-EngineeringBig-DataData-ArchitectureCloud-EngineeringBackend-EngineeringSenior
Job Description
About Our CompanyWe’re a physician-led, patient-centric network committed to simplifying health care and bringing a more connected kind of care.Our primary, multispecialty, and urgent care providers serve millions of patients in traditional practices, patients' homes and virtually through VillageMD and our operating companies Village Medical, Village Medical at Home, Summit Health, CityMD, and Starling Physicians.When you join our team, you become part of a compassionate community of people who work hard every day to make health care better for all. We are innovating value-based care and leveraging integrated applications, population insights and staffing expertise to ensure all patients have access to high-quality, connected care services that provide better outcomes at a reduced total cost of care.Please Note: We will only contact candidates regarding your applications from one of the following domains: @summithealth.com, @citymd.net, @villagemd.com, @villagemedical.com, @westmedgroup.com, @starlingphysicians.com, or @bmctotalcare.com.Job DescriptionEMPLOYER: Village Practice Management Company, LLC DBA VillageMDJOB POSITION: Principal Data EngineerLOCATION: 1 Diamond Hill Road, Berkeley Heights, NJ 07922PAY RANGE: $169,541 – $195,000 per yearDUTIES: Secure data sources, design data models, and build distributed components, pipelines, and tools that enable the organization to make analytical, data-driven decisions. Develop and automate large scale, high-performance data processing systems (batch and streaming) to drive business growth and improve product experience. Build scalable data pipelines leveraging orchestration technologies. Design our data models for optimal storage and retrieval and to meet critical product and business requirements. Understand and influence logging to support our data flow, architecting logging best practices where needed. Contribute to shared Data Engineering tooling & standards to improve the productivity and quality of output for Data Engineers across the company. Improve data quality by using & improving internal tools to automatically detect issues. Design and implement data pipelines and data summarization processes using SQL and apply understanding of relational data structures. Create a summary data platform supporting our presentation layer that allows clinicians and operators in our practices to pinpoint interventions on-demand to patients most in need. Locate trends in data sets and build algorithms for large and complex data sets. Analyze and design the best ways to expand our data model to incorporate more data that is mission critical. Analyze large data sets to identify gaps and inconsistencies, provide data insights, and advance effective product solutions. Own projects to design and implement best-in-class data processing enabling clean data flow directly to employer’s data model and on to our presentation layer. Assist with development of systems, strategies, and standards to ensure the quality of the data collected, organize and maintain the data properly, and effectively deliver data needed by primary care physicians to manage patient healthcare. Understand technical and business processes required to identify and deliver appropriate and useful data to our organization. Work with analytics, engineering, and operations to design and implement new analytics products that support improving patient health. Follow data best practices, including engagement in project status meetings and self-education on new technologies and practices. Design a new concept within our data model to meet a new operational or analytical need. Tools/technologies used: Python, Scala, pytest, Java, Axis, Flask, Spark, Jupyter Notebook, Healthcare Provenance, Matillion, ETL, dbt Core, dbt Cloud, CI-CD (continuous integration/continuous delivery), GitHub Action workflows, AWS - S3, SQS, SNS, EventBridge, Schedulers, eCR, Workspace, ReactJS, Talend, athenaOne, Healthjump, PyDeequ. Position may telecommute from anywhere in the U.S.
REQUIREMENTS: This position requires at least a Bachelor’s degree in Computer Science, Computer Engineering, or a closely related field and at least five (5) years of post-baccalaureate, progressive experience in software or application development (any title) including at least three (3) years of experience in data engineering deploying and managing data pipelines, managing clinical and claims data pipelines, using Python, Amazon Web Services (AWS) S3, SQS, and SNS, ETL data, GitHub, Scala, Java, Spark, and EventBridge. Position may telecommute from anywhere in the U.S.For Colorado Residents only: This is an exempt position. The base compensation range for this role is $169,541 – $195,000. At VillageMD, compensation is based on several factors including but not limited to education, work experience, certifications, location, etc. The selected candidate will be eligible for a valuable company benefits plan, including health insu
