// PROJECT CASE STUDY
ANONYMISED UTILISATION SPACE DATA

Data Analyst Apprenticeship EPA

Space Utilisation

An anonymised analytics case study showing how clinical occupancy data can support better room allocation, scheduling, and operational planning.

Role: Space Design Manager Source: Occupeye Sensor Activity Matrix Tools: Python, Power Query, Power BI Focus: Measurement + Optimisation

Key Outcomes

< 85%

Average occupancy stayed below the clinic utilisation benchmark, highlighting clear headroom for better allocation.

35.21%

MAE reduction once public holidays were removed from the forecast input.

39.13%

RMSE reduction after treating holiday noise separately in the model.

12:00-14:00

A repeatable midday lull that could inform session spacing and room release decisions.

Context

This project was carried out to test whether sensor-derived occupancy data could provide credible evidence for improving clinical space use in a large public building setting. The aim was not only to measure utilisation, but to show where practical optimisation opportunities existed for scheduling, allocation, and future planning.

Scope

The analysis focused on a single-site Occupeye export, with core reporting centred on weekday activity between 08:00 and 18:00. Comparisons were made across anonymised areas, service group patterns, time-of-day demand, and forecast behaviour to show both current utilisation and the operational choices the data could support next.

Tools & Techniques

The project combined lightweight tooling with practical operational analysis, keeping the work usable within a secure healthcare setting while still producing decision-ready outputs.

Occupeye System

Primary source for sensor activity exports and occupancy pattern analysis.

Excel Power Query

Used for initial cleaning, reshaping, and fast preparation of large CSV exports.

Python

Used for aggregation, statistical analysis, and forecast testing across the cleaned dataset.

Power BI

Used to package findings into clearer visuals and portfolio-ready reporting outputs.

Spatial Utilisation Analysis

Compared demand across buildings, departments, and room groupings to find underused capacity.

Timetable and Occupancy Modelling

Reviewed weekday and hourly demand patterns to understand where operational pressure concentrated.

SARIMA Forecasting

Tested future usage patterns and validated model performance with MAE and RMSE.

Stakeholder Requirement Analysis

Framed the outputs around scheduling, space pressure, and next-step data requirements.

Data Cleaning and Anonymisation

Removed incomplete records where needed and presented results in a portfolio-safe format.

Visuals

Anonymised aggregate

Weekly Timetable Density Map

Rebuilt from the project export using weekday average activity by hour. Darker cells indicate heavier observed demand and make peak periods easier to spot at a glance.

08 09 10 11 12 13 14 15 16 17 Mon Tue Wed Thu Fri

Average Utilisation by Area

Average utilisation by area highlights where available clinical capacity may be underused and where redistribution could be tested first.

Area A
63.5%
Area B
58.7%
Area C
53.7%
Area D
53.4%

Daily Activity Profile

The daily profile shows where demand is genuinely concentrated and where recurring dips create room for rescheduling or better release of spare capacity.

08 10 12 14 16 18

Weekday Utilisation by Area

Weekday variation is relatively steady overall, but it still helps show where demand is lighter or heavier across the week when balancing allocations.

Mon Tue Wed Thu Fri Area A Area B Area C Area D

Forecast Accuracy After Calendar Cleaning

Removing public holidays reduced forecast error materially, which matters if the model is to support planning rather than only describe historic usage.

MAE RMSE 19.37 12.55 32.81 19.97 Purple: baseline Green: holidays removed

Variation by Service Group

Differences between anonymised groups help narrow where deeper booking, room-type, or specialist-space review would add the most value.

A B C D E F

Constraints

Some records lacked room identifiers, room type, department, or area fields, which limited room-level matching and reduced the depth of certain breakdowns. The result was still strong enough for high-level utilisation measurement, while also making clear where cleaner metadata and linked systems would unlock sharper optimisation work.

Method

  • Exported the Sensor Activity Matrix from Occupeye across an extended date range.
  • Focused the core analysis on one anonymised site, using weekdays between 08:00 and 18:00.
  • Prepared and reshaped the data in Excel Power Query before further analysis in Python.
  • Compared activity by area, service group, day of week, and time of day.
  • Tested SARIMA forecasts with and without public holidays using MAE and RMSE.

Findings

  • Measured occupancy remained below the expected 85% benchmark across all reviewed areas, confirming that underuse was broad rather than isolated.
  • One lower-utilisation area stood out as the clearest short-term candidate for redistribution or additional scheduling pressure.
  • Weekday demand followed a stable operating rhythm, with stronger late-morning activity and lighter demand at the edges of the day.
  • The lunchtime lull suggested that utilisation is shaped by working patterns as well as booked demand.
  • Removing public holidays improved forecast accuracy materially, showing that calendar effects needed to be handled explicitly before using the model for planning.
  • The analysis was strong enough to justify further integration with booking, attendance, and room reference data.

Privacy and Anonymisation

Organisation-specific names, room identifiers, and sensitive operational detail have been anonymised or selectively generalised for this portfolio version. The analytical structure, method, and direction of the findings remain representative, which allows the case study to demonstrate approach and decision value without exposing confidential information.

Potential Optimisation Opportunities

The analysis did more than measure underuse. It pointed to practical areas where allocation, scheduling, and planning decisions could be improved once supporting operational data is linked in.

Balance Demand Across the Week

Weekday demand was not identical. Lighter periods at the edges of the week suggest some activity could be redistributed without increasing estate pressure.

Smooth Peak Demand

Late-morning pressure was more consistent than early morning or late afternoon use, which creates scope to spread suitable sessions more evenly.

Match Room Size to Session Size

Low utilisation does not always mean low demand. It can also indicate sessions being placed in rooms with more capacity than required.

Use Specialist Rooms More Deliberately

Where higher-value treatment or specialist spaces are underused, tighter allocation rules or shared access could reduce idle capacity.

Reduce Underused Capacity

Lower-use zones provide a practical pilot area for reallocation, consolidation, or overflow use before new space is commissioned.

Recommendations

  • Link Occupeye data with clinic schedules, booking records, attendance, and Micad room reference data so planned and observed use can be compared properly.
  • Start with a pilot in the lowest-utilisation area to test reallocation or shared-use changes before widening the approach.
  • Improve room identifiers, sensor coverage, and metadata standards so future analysis can assess room type, room size, and departmental allocation more reliably.
  • Develop recurring Power BI reporting so senior stakeholders can track utilisation, anomalies, and actions over time.
  • Extend forecasting for seasonal planning once holidays, closures, and other operational constraints are handled consistently.