Drug Reviews NLP
Mining 215,063 patient reviews to identify which medications outperform alternatives — and quantifying the business impact of drug non-adherence.
Overview
The Question
Patients frequently switch or stop medications due to dissatisfaction. Which drugs significantly underperform their alternatives, and what's the financial impact of this non-adherence?
The Approach
Combined SQL analysis, NLP sentiment extraction, and topic modeling on 215K patient reviews from Drugs.com (2008-2017) to surface rating gaps, sentiment discrepancies, and recurring side effect themes.
Key Findings
5+ Point Rating Gap
Found systematic performance gaps between best and worst drugs within the same conditions — some patients getting significantly worse care.
8 Underperforming Conditions
Identified conditions with average ratings below 6.0, indicating systematic unmet patient needs across multiple medications.
Birth Control: 18% of Reviews
Highest volume category despite below-average satisfaction — a major opportunity for pharmaceutical improvement.
Sentiment vs Rating Mismatch
VADER analysis revealed reviews where explicit ratings didn't match expressed sentiment — hidden dissatisfaction signals.
Methodology
Technical Highlights
The Dashboard
Built a Streamlit app ("Drug Alternative Finder") that lets users explore the data — compare drugs within conditions, view rating distributions, and surface better-rated alternatives.
Try the Live App ↗