Hello! 👋

I'm Zaher

Healthcare Data Engineering — I make clinical quality methodology production-ready

About Me

I started in writing and editing, spent nearly a decade in mixed-methods health services research at UW-Madison starting in 2009, and moved into data engineering when I realized the tools I needed to study complex healthcare systems didn't exist yet. The thread across all of it is an interest in complex systems, not numbers for their own sake. The numbers and tools are just the best available instrument for whatever system I'm studying. Right now that means Medicare Star Ratings, HEDIS pipelines, and data integration across claims and eligibility sources. I build with Python and SQL and I care most about analytics that ship as products, not reports. I continue to engage with research questions at the edges of the field — my writing focuses on methodology problems in healthcare data that don't have clean answers yet.

Clinical Quality & Regulatory

HEDIS / Quality Measurement Value-Based Care (ACO/MSSP) Medicare Star Ratings HIPAA / Healthcare Data Governance

Data Engineering

SQL (Postgres, Redshift, SQL Server, MariaDB) Python (pandas, PySpark) dbt Bash AWS (S3, Glue) Azure Data Factory Databricks

Clinical Systems

Epic Clarity Analytics Cerner Analytics athenahealth Analytics Veradigm Analytics Clinical Data Integration (EHR/Claims)

Analytics & BI

Tableau Power BI Sisense

Statistics & Research Methods

SAS Stata Mixed Methods Research Survey Design Interrupted Time Series

By the Numbers

0+

Years Experience

0+

Organizations

0

Blog Posts

0+

Technical Skills

0+

Publications

0+

Presentations

Now

Right now I’m at Baltimore Health Analytics building a Medicare Advantage analytics platform, where I lead the data engineering and methodology. I’m focused on the places where CMS Technical Notes and production reality diverge — which turns out to be most of them.

Projects

01

Client-Side Stars Analytics Dashboard (Single-File HTML)

Standalone, in-browser dashboard implemented as a single HTML file using Chart.js 4.4.1, PapaParse 5.4.1, jsPDF 2.5.1, and chartjs-plugin-datalabels to visualize local CSV data and generate PDF reports with no server dependency and no data leaving the user's machine.

Chart.jsPapaParsejsPDFJavaScriptHTMLClient-Side Analytics
02

Lessons Learned from healthfinch's Charlie Practice Automation: A Case Study

Analytics used in case study at OCHIN, Inc. examining healthfinch's Charlie Practice Automation Platform implementation across multiple community health centers. Focus on workflow optimization and return on investment achieved through data-driven decision making.

Linear RegressionStatisticsSisenseEpic Clarity
03

Care Delivery Workflow Changes

Analyzed organization-wide care delivery changes using interrupted time series analysis on clinic panel data. Measured impact of change initiatives with segmentation and regression modeling.

StataSASTime Series AnalysisOutpatient Analytics
04

UW Health Patient Relations Survey Redesign

Redesigned primary care patient survey for UW Health Patient Relations/Resources department. Conducted pretesting with patient interviews to ensure survey validity and usability improvements.

Survey DesignPatient ExperienceHealthcare QualityQualtrics
05

Cancer Prevention in Mental Health Populations

Conducted research studying cancer prevention screening rates in patients with co-morbid severe mental illness using logistic regression and the Elixhauser Comorbidity Index.

Logistic RegressionPopulation HealthHealthcare DisparitiesSAS
06

Healthcare Workforce Transition Discovery Platform (O*NET-Powered MVP)

Full-stack MVP built with FastAPI, PostgreSQL, and Redis/Celery that uses O*NET occupation and skill data with logistic-regression calibration to surface job-transition paths — originally motivated by healthcare workforce shortages and the question of which adjacent roles clinical and administrative staff can reskill into. Generates Ready Now, Trainable, and Long-Term Reskill recommendations with gap analysis.

FastAPIPostgreSQLSQLAlchemyRedisCeleryscikit-learnLogistic RegressionO*NETPythonPydantic
07

Law School Graduation Rates Analysis

Statistical analysis of law school graduation rates using multivariate regression modeling. Conducted as part of research work at the Center for Patient Partnerships.

Regression AnalysisEducation AnalyticsStataSAS

Live Demo

This model runs entirely in your browser. No data leaves your machine. It demonstrates how Medicare Advantage health plans can estimate their Star Rating from quality measure inputs.

star_rating_predictor.py
75%
55% · 25th · 50th · 75th pctl 95%
3.5
2.5 · 25th · 50th · 75th pctl 4.8
70%
50% · 25th · 50th · 75th pctl 92%
15%
8% · 25th · 50th · 75th pctl 22%
Off

Predicted Rating

3.5
5★
0%
4★
0%
3★
0%
2★
0%
1★
0%

Ordinal Thresholds

≥2
≥3
≥4
≥5
z
1★ region5★ region

What Would Move the Needle?

Adjust sliders to see guidance.

Synthetic coefficients calibrated to CMS 2025 measure weights and MA-PD star distribution. Not trained on contract-level data. Does not include case-mix (CAI) adjustment. Full methodology →

2019Academic Medicine

Broadening medical students' exposure to the range of illness experiences: a pilot experimental curriculum trial

Pandhi N, Gaines ME, Deci D, Schlesinger M, Culp C, Karp Z, Legler C, Grob R

Pilot experimental curriculum trial evaluating approaches to broadening medical students' exposure to the range of illness experiences, with a focus on depression education.

Curriculum EvaluationMedical EducationMixed Methods Research
2019Health Environments Research & Design Journal

Influence of environmental design on team interactions across 3 family medicine clinics: perceptions of communication, efficiency, and privacy

Karp Z, Kamnetz S, Wietfeldt N, Sinsky C, Molfenter T, Pandhi N

Led comprehensive mixed-methods study on how environmental design influences team interactions in family medicine clinics. Secured $18,000 grant, conducted 120 hours of observations, facilitated 9 focus groups with 40 participants.

Mixed Methods ResearchGrant WritingHuman Subjects ProtocolsNVivo
2018International Journal of Healthcare Management

Medicare Shared Savings Programs: higher cost accountable care organizations are more likely to achieve savings

Berkson S, Davis S, Karp Z, Jaffery J, Flood G, Pandhi N

Analysis of cost patterns and savings achievement across Medicare Shared Savings Program ACOs, finding that higher-baseline-cost organizations were more likely to achieve shared savings.

Healthcare EconomicsACO AnalyticsValue-Based CareStataSAS
2016Implementation Science (Proceedings of the 3rd Biennial Conference of the Society for Implementation Research Collaboration)

An efficient process of gathering diverse community opinions to inform an intervention

Pandhi N, Jacobson N, Serrano N, Hernandez A, Zeidler-Schreiter E, Wietfeldt N, Karp Z

Methodological contribution on gathering diverse community opinions to inform health system interventions, presented at a national implementation science conference.

Community EngagementImplementation ScienceQualitative Research
2014Journal of Innovation in Health Informatics

Approaches and challenges to optimizing primary care teams' electronic health record usage

Pandhi N, Yang WL, Karp Z, Young A, Beasley JW, Kraft S, Carayon P

Qualitative study examining how primary care teams use and optimize EHR systems, identifying key barriers and facilitators to effective adoption.

EHR OptimizationQualitative ResearchGrounded TheoryPrimary Care
2012Proceedings of World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education

Approaches and challenges to optimizing the use of electronic health records in primary care (preliminary findings)

Yang W, Pandhi N, Karp Z, Young A, Beasley J, Kraft S, Carayon P

Conference proceedings presenting preliminary findings on EHR optimization challenges in primary care settings.

EHR OptimizationHealth InformaticsPrimary Care

Presentations

17 presentations across conferences, workshops, and seminars

Since 2018, my methodology work has shifted to long-form technical writing — see my blog for current work on Star Ratings methodology, ordinal regression, and HEDIS implementation.

2017

Primary care patient perceptions of clinic design across three practices

National Collaborative for Improving Primary Care Through Industrial and Systems EngineeringMadison, WI

Identifying effective strategies for scaling an intervention to engage patients in care redesign

UW Institute for Clinical and Translational Research Dissemination & Implementation Short CourseMadison, WI
2016

Developing an effective writing collaborative to rapidly disseminate lessons from system redesign

Association of American Medical Colleges' Integrating Quality MeetingChicago, IL

Primary care team perceptions of team-based care and clinic design types across three practices

National Collaborative for Improving Primary Care Through Industrial and Systems EngineeringMadison, WI

Experience

Lead Data Engineer

Baltimore Health Analytics

Nov 2025 - Present

The person who makes sure the methodology is right before it ships to health plans.

  • Leading data engineering and methodology for a Medicare Advantage analytics platform in Python and SQL, replacing manual processes in a legacy Rails orchestration layer with reproducible, tested pipelines
  • Building HEDIS measure pipelines (SUPD, SPC, PQA adherence) and Star Rating simulation models that turn CMS Technical Notes into production-grade logic — including audit-ready measure validation aligned to NCQA specifications
  • Architecting claims, eligibility, and clinical quality data integration across multiple health plan contracts, normalizing heterogeneous source formats into a unified analytic layer

Healthcare Analytics Manager, Embedded Refills and Care Gaps

Health Catalyst

Aug 2020 - Aug 2025

Owned the data platform for a clinical quality product used by health systems nationwide.

  • Redesigned AWS S3 cloud storage architecture into a medallion (bronze/silver/gold) ELT pattern with Python and dbt, halving cloud storage and compute spend and cutting a multi-day pipeline refresh to same-day delivery
  • Built self-service aggregation systems in SQL and Python within a Rails-based ETL pipeline, replacing a monthly manual reporting cycle with on-demand daily refreshes used by product and customer success teams
  • Developed unified financial tracking dashboards in Tableau and Power BI monitoring $MM+ in revenue and contract performance, giving executive stakeholders real-time visibility into renewal risk
  • Improved production data quality by validating against RxNorm and clinical quality standards across the Redshift-based analytics platform, reducing downstream measure discrepancies flagged in client audits

Healthcare Analytics Manager

healthfinch

Jan 2019 - Jul 2020

Promoted to lead the analytics function and tie it directly to revenue.

  • Led ROI modeling and sales support that contributed to $1MM+ in recurring revenue by translating clinical workflow data into customer-facing value demonstrations
  • Built dashboards that drove 7x growth in internal user adoption and eliminated 400+ hours of manual reporting preparation annually
  • Managed analytics roadmap and cross-functional stakeholder relationships across product, engineering, and customer success teams, aligning data priorities with quarterly business objectives

Healthcare Analytics Specialist

healthfinch

Dec 2017 - Dec 2018

First analytics hire — built the reporting infrastructure from zero.

  • First analytics hire — built HIPAA- and HITRUST-compliant reporting infrastructure from scratch, establishing the data foundation the product ran on
  • Designed reusable SQL scripts deployed across 50+ health systems, standardizing performance benchmarks used in sales and marketing
  • Stood up Sisense BI platform and built the initial dashboard suite, giving sales, product, and customer success teams their first self-service access to product usage data

Assistant Researcher (and earlier roles)

University of Wisconsin-Madison, Department of Family Medicine and Community Health

Sep 2009 - Jun 2018

Nine years across the UW School of Medicine, starting as a research specialist and advancing to lead statistician on published, federally-funded work.

  • Analyzed 50 years of longitudinal survey data on 10,000+ adults in the Wisconsin Longitudinal Study, integrating survey, health, and administrative records across decades of follow-up
  • Led qualitative research on EHR optimization in primary care teams, published in the Journal of Innovation in Health Informatics — an early study of how health systems fail to extract value from clinical data
  • Served as lead statistician on ACO cost research integrating EMR, claims, and satisfaction data, producing cohort analyses used in ACO governance decisions and a peer-reviewed publication
  • Managed IRB compliance, federal reporting, and data security protocols across studies funded by AHRQ, NIH, and PCORI

Principal

Sustainable Clarity

2007 - 2014
  • Managed up to 8 mentored copy editors, graphic designers, and photographers at a time to create print-ready books and dossiers
  • Wrote articles syndicated by national newswires (Thomson Reuters, LexisNexis, New York Times)
  • Edited and indexed client manuscripts published as books, peer-reviewed journals, articles, dissertations, grants, and newsletters

Testimonials

It's rare to come across someone like Zaher — not just for his intelligence, but for the care, curiosity, and sense of responsibility he brings to everything he does. He consistently pushes himself to deliver thoughtful, high-quality work because he genuinely wants to make a difference — for the team, for the client, and for healthcare and patients. Zaher has a natural ability to think deeply about problems, often catching nuances others miss, and he balances that with a strong commitment to execution. He doesn't let go of a challenge until he's found the best path forward — always asking the right questions, weighing the trade-offs, and staying focused on what will have the greatest impact. What I've always appreciated about Zaher is how he balances his focus and drive with a great sense of humor. He's collaborative, approachable, and brings a humor to the work that makes even the tough days a bit easier. His ability to stay grounded and keep things in perspective — while still holding himself to a high standard — makes him a great partner to have on any project. He leads with integrity and consistently aims to do what's right — even when it takes more effort. That kind of mindset is inspiring, and it's something I deeply respect. Any team working to solve meaningful problems in healthcare would benefit from having Zaher in their corner.

Joanna Laucirica, PMP

Director, Customer Operations, Health Catalyst

I had the pleasure of working with Zaher Karp for four years at Health Catalyst, where he served as the Healthcare Analytics Manager for the Embedded Refills product. Zaher was solely responsible for ensuring timely, accurate data delivery across multiple EHRs—Athena, Allscripts, Cerner, and Epic. He successfully led the migration of analytics from our previous platform, Sisense, to Pop Insights, and implemented automated weekly data refreshes for Cerner, significantly improving efficiency and reliability. Despite being the only engineer on the team, Zaher consistently delivered high-quality work. He is intelligent, thorough, and deeply committed to understanding customer needs. He often joined client calls to clarify requests and wasn't afraid to push back when necessary to protect data integrity and long-term scalability. I highly recommend Zaher for any role that requires a capable, dependable, and customer-focused data engineer.

Jessica McCay

Director of Customer Success, Health Catalyst

The phrase "had the privilege to work with" does not even come close to the experience that I had working with Zaher. It became quickly apparent that he is deeply committed, profoundly intelligent, and highly versatile. On top of being a SQL "wizard", he worked tirelessly to optimize existing solutions and provided a great deal of cost savings in the process. Technical qualifications aside, and he has many, one of my favorite things about Zaher was his ability to bring his passion, enthusiasm, and humor to any situation. Zaher and I commonly crossed paths at Health Catalyst's "Open Space" initiative where we would collaborate to foster learning through various knowledge sharing or working sessions. He would provide thought-provoking feedback and worked to help this initiative reach as many people as possible. I wholeheartedly recommend Zaher as someone who would improve any organization. They truly would be lucky to have him! He genuinely works hard to better the lives of those that enter our healthcare system and has indirectly improved the lives of so many patients and providers.

Jake St. Germain

Software Developer, Health Catalyst

I had the pleasure of working with Zaher for almost 5 years at Healthfinch/Health Catalyst. Zaher was the mastermind behind the data dashboards that our customer success team and customers relied on. He provided expert guidance to analysts who needed assistance with their data migration to our system, and ensured any bugs or feature requests were addressed timely. Zaher was also always willing to take time to answer questions and explain the "why" behind our data. We were so fortunate to have his expertise.

Emily Rodriguez

Professional Services Team Lead, Lunit (formerly Health Catalyst)

Education

Master of Public Health (MPH), Biostatistics

University of Wisconsin-Madison

2013 - 2015
  • Health Innovation Program Research Trainee
  • Trained in dissemination & implementation research, qualitative interviewing, and focus group facilitation
  • Designed, wrote grant proposal (awarded $18,000), and published peer-reviewed research on clinic environments

Industrial & Systems Engineering Graduate Certificate, Patient Safety

University of Wisconsin-Madison

2014 - 2015
  • Trained through AHRQ-funded Systems Engineering Initiative for Patient Safety
  • Completed quality improvement projects in medication safety using root cause and job analysis

Bachelor of Arts (BA), English Literature

University of Wisconsin-Madison

2003 - 2007
    psql — zaher_resume_db
    zaher_resume_db=#
     name        | role               | domain                            | tools
    -------------+--------------------+-----------------------------------+----------------------
     Zaher Karp  | Lead Data Engineer | Healthcare Payer / Regulated Data | SQL, Python, dbt, AWS
    (1 row)
    zaher_resume_db=#