1. Data Preparation
Four datasets were analyzed sequentially to build a complete picture of NHS GP activity:
- actual_duration.csv: Appointment length data (7 months).
- national_categories.csv: National classification breakdowns (11 months).
- appointments_regional.csv: The core dataset — 742,529,759 appointment records over 30 months.
- tweets.csv: Public sentiment data scraped from Twitter for qualitative signal.
Dates were cast to datetime for temporal analysis. Over 21,600 duplicate rows were removed from the regional dataset alone. Rows flagged as Unknown / Data Quality or Unmapped were intentionally retained — their patterned occurrence was itself informative, and discarding them wholesale would have introduced bias.
2. Staffing & Capacity
The headline finding sounds reassuring: on average, the NHS stayed well below its estimated single-day threshold of 1,200,000 appointments. But averages hide the story.
When the top three busiest ICBs are isolated, NHS North West London alone approached the 1,200,000 threshold on its own — a figure the national average completely obscures. Conversely, the bottom three ICBs never exceeded 60,000 appointments in any single month, even during the peak COVID-19 period, dragging the mean downward.
3. The Booking Interval Effect
The single most actionable finding in the dataset: the further in advance an appointment is booked, the less likely a patient is to show up. Same-day bookings have a 95.8% attendance rate. This drops progressively, falling below 85% at 15–21 days out and reaching just 72.5% for appointments booked more than 28 days in advance.
4. Appointment Duration & Delivery
The duration data reveals a clear operational reality: GP appointments are overwhelmingly short.
- 52.1% of all documented appointments lasted 15 minutes or less.
- A further 24.0% had unknown or data-quality-flagged duration, making definitive conclusions difficult.
- Only 23.9% fell into the 16-minutes-to-1-hour bracket.
This distribution has a direct implication: the majority of GP consultations are short enough to be viably conducted over telephone or video. The NHS already demonstrated this during COVID-19 — the pandemic served as an unplanned proof-of-concept for remote delivery of short-format appointments.
5. Patterns & Recommendations
Reduce Long Booking Horizons
This is the highest-priority operational fix. The data shows a consistent, steep decline in attendance for appointments booked more than two weeks in advance. Reducing advance booking windows or implementing automated reminders at the 14-day and 7-day marks would directly reduce the 72.5% attendance rate seen at 28+ days.
Accelerate Remote Appointment Adoption
Given that over half of all appointments are 15 minutes or less, telephone and video consultations can handle the majority of the NHS's volume. This reduces friction for patients (no travel, easier to cancel if unnecessary) and cuts the cost of a missed slot for the NHS.
Address Localized Strain, Not Just Averages
National-level averages mask the fact that North West London ICB operates close to system capacity on its own. Resource allocation decisions made on the basis of national averages will systematically under-serve high-demand areas. Regional granularity is essential for staffing planning.
Mine Twitter as a Cost-Free Signal
The Twitter dataset offers a low-cost window into patient sentiment about booking experiences, GP quality, and service accessibility. Sentiment analysis of this stream could surface emerging issues before they manifest in attendance data — and provides a channel for the NHS to communicate new appointment formats to the public.
Full Python notebook and statistical models available upon request.
The data tells a pragmatic story: the NHS is not failing at capacity — it is failing at timing. Patients who book far in advance are the most likely to miss, not the most likely to be unwell. Pairing shorter booking windows with scalable remote delivery for routine consultations could recover millions of appointment-hours annually, without placing additional financial burden on patients or clinical staff.