LITF-PA-2026-020 · Building Science / Energy Efficiency

System and Method for Non-Invasive Continuous Building Envelope Thermal Performance Assessment Using Smart Thermostat Heating and Cooling Response Curve Analysis with Edge-Deployed Machine Learning

Cross-section of a house showing thermal insulation layers with smart thermostat displaying temperature response curves
⚖️ Prior Art Notice: This document is published as defensive prior art under 35 U.S.C. § 102(a)(1). The inventions described herein are dedicated to the public domain as of the publication date above. This disclosure is intended to prevent the patenting of these concepts by any party.

Abstract

Disclosed is a system and method for continuously assessing the thermal performance of a building envelope without invasive testing, dedicated sensors, or professional energy audits. The system extracts thermal time constants and effective thermal resistance (R-value) estimates from the heating and cooling response curves already recorded by commodity smart thermostats (Nest, Ecobee, Honeywell Home, etc.), cross-references these curves against hyperlocal weather data (outdoor temperature, solar irradiance, wind speed) obtained from public APIs, and applies an edge-deployed Bayesian state-space model to separate envelope performance from HVAC equipment efficiency, occupant behavior, and internal heat gains. By tracking the evolution of inferred thermal parameters over months and years, the system detects insulation degradation, air seal failures, window gasket deterioration, and moisture intrusion events at the room or zone level, generating actionable alerts for homeowners and building managers without requiring a single additional sensor, wire, or site visit.

Field of the Invention

This invention relates to building science and energy efficiency, specifically to non-invasive methods for estimating building envelope thermal performance using time-series data from existing smart home infrastructure combined with edge-deployed machine learning.

Background

Buildings account for 40% of total U.S. energy consumption (EIA, 2024), with space heating and cooling representing roughly half of that figure in residential structures. The building envelope — walls, roof, windows, doors, and foundation — is the primary determinant of how much energy is required to maintain comfortable indoor temperatures. Envelope degradation is pervasive: the DOE Building America program estimates that 90% of U.S. homes built before 2000 have insulation levels below current code requirements, and even code-compliant insulation degrades over time through settling, moisture absorption, pest damage, and thermal bridging from renovation work.

Current methods for assessing envelope thermal performance are expensive, disruptive, and episodic:

Smart thermostats have penetrated roughly 18% of U.S. households as of 2025, with 30-40 million installed units. Every one of these devices continuously records indoor temperature at 5-minute intervals (or finer), HVAC system state (heating, cooling, fan-only, idle), setpoint schedules, and in some cases humidity and occupancy. This data stream is a rich but unexploited signal for envelope performance inference.

The physical basis is straightforward. A building's thermal response to HVAC input follows a first-order (or multi-order) exponential decay characterized by the time constant τ = R × C, where R is the effective thermal resistance of the envelope and C is the thermal capacitance of the interior mass. Bacher and Madsen (Applied Energy, 2011) demonstrated that grey-box models using indoor-outdoor temperature differentials can estimate R and C simultaneously from smart thermostat data with R² > 0.90 under favorable conditions. However, their work required custom instrumentation, manual model specification, and was validated only in controlled test cells.

The gap in the prior art is a complete, production-ready system that: (a) automatically extracts thermal response curves from commodity smart thermostat data requiring zero additional hardware, (b) separates envelope degradation from HVAC efficiency changes and occupant behavior variations through a Bayesian approach, (c) tracks thermal parameters continuously over months and years to detect gradual degradation, and (d) localizes degradation to specific zones in multi-thermostat homes.

Detailed Description

1. Data Acquisition Layer

The system ingests time-series data from smart thermostat APIs (Nest Device Access, Ecobee API, Honeywell Home API, or local protocols such as Matter/Thread). Required signals per thermostat zone, sampled at minimum 5-minute intervals: indoor temperature T_in(t) in °C with ±0.5°C resolution (typical smart thermostat accuracy); HVAC system state s(t) ∈ {heating, cooling, fan_only, idle}; thermostat setpoint T_set(t); and occupancy state o(t) ∈ {home, away, sleep} when available from thermostat occupancy sensors or user schedules.

External weather data is obtained from public APIs: OpenWeatherMap or Open-Meteo (free, no API key required for basic tier). Required signals at 15-minute or finer resolution: outdoor temperature T_out(t), solar irradiance GHI(t) in W/m², wind speed v_wind(t) in m/s, and relative humidity RH_out(t). Location is derived from thermostat zip code or GPS coordinates.

2. Response Curve Extraction

The core signal extraction identifies "natural experiment" windows when the building undergoes thermal transients that reveal envelope properties. Three event types are targeted:

For each identified event, the system extracts a response curve: a time series of (t, T_in, T_out, GHI, v_wind) tuples spanning the transient period. Typical yield: 2-8 usable coast-down events per week during heating or cooling season, depending on climate zone, setpoint schedule, and occupancy patterns.

3. Grey-Box Thermal Parameter Estimation

Each response curve is fitted to a lumped-parameter thermal network model. The baseline model is a second-order (2R2C) circuit: R_env represents the effective thermal resistance of the building envelope (m²·K/W), aggregated across all surfaces; C_bldg represents the thermal capacitance of the building interior (J/K), including furnishings, internal walls, and floor slab; R_inf represents the effective thermal resistance associated with air infiltration, modeled as wind-speed-dependent: R_inf = R_inf0 / (1 + α·v_wind); and Q_solar represents solar heat gain, modeled as Q_solar = A_eff · GHI · τ_window, where A_eff is effective solar aperture area and τ_window is effective window transmittance.

The model state equation is:

C_bldg · dT_in/dt = (T_out - T_in)/R_env + (T_out - T_in)/R_inf + Q_solar + Q_hvac + Q_internal

where Q_hvac is HVAC thermal output (inferred from system state and estimated capacity) and Q_internal accounts for internal heat gains (lighting, appliances, occupants — estimated from occupancy state and time of day using DOE residential prototype building assumptions: 2.2 W/m² average, 4.4 W/m² peak during occupied evening hours per PNNL residential prototypes).

Parameter estimation uses maximum likelihood via the continuous-discrete extended Kalman filter (CD-EKF) as described by Bacher and Madsen (2011), adapted for the specific input signals available from smart thermostats. The filter processes each response curve to produce point estimates and confidence intervals for {R_env, C_bldg, R_inf, A_eff·τ_window}.

4. Bayesian Longitudinal Tracking

Individual response curve estimates are noisy (typical coefficient of variation 15-30% per event, driven by internal gain uncertainty and sensor noise). The system's key innovation is longitudinal aggregation: a hierarchical Bayesian model treats each event's parameter estimate as a noisy observation of slowly-varying true parameters.

The state-space model tracks θ_t = {R_env, C_bldg, R_inf, A_eff·τ} with a random-walk prior: θ_t = θ_{t-1} + ε, where ε ~ N(0, Q). The process noise covariance Q is set to allow parameter drift on the timescale of months (Q_ii ≈ (0.01·θ_i)² per week), reflecting the physical reality that insulation degradation, settling, and moisture accumulation occur gradually. Observation model: each event's parameter estimate y_k ~ N(θ_t, Σ_k), where Σ_k is the per-event estimation uncertainty from the CD-EKF.

Inference uses sequential Monte Carlo (particle filter) with 1,000 particles, resampled via systematic resampling when effective sample size drops below 500. Posterior distributions are stored as Gaussian mixture approximations (3 components) for compact representation. Total storage requirement: ~200 bytes per zone per update.

5. Anomaly Detection and Alerting

The system generates alerts when posterior parameter estimates deviate significantly from the building's established baseline. Two detection modes operate concurrently:

In multi-zone homes (2+ thermostats), the system performs differential diagnosis. If R_env degrades in one zone while remaining stable in others, the alert identifies the specific zone and suggests likely failure modes based on building vintage, zone location (e.g., attic zone → roof insulation, ground-floor zone → foundation/crawl space), and event characteristics.

6. Edge Deployment Architecture

The entire inference pipeline runs locally on a home hub (Raspberry Pi 4, Google Home, Amazon Echo with local processing, or Apple HomePod). Memory footprint: < 50 MB. CPU usage: < 5% average on a Cortex-A72. No thermostat data leaves the home network. Weather API calls use only zip code, revealing no personally identifiable information. The system exposes a local REST API (port 8420) serving JSON with current parameter estimates, trend plots (SVG), and alert status. An optional cloud sync endpoint transmits only anonymized, aggregated thermal parameters (no indoor temperature traces) for fleet-level benchmarking.

7. Calibration and Validation

Initial calibration leverages publicly available building stock data. For homes with known construction year and zip code, the system initializes parameter priors using RECS (Residential Energy Consumption Survey) building characteristic distributions and IECC code-vintage R-value tables. For a 1990s-era wood-frame home in IECC Climate Zone 4, the prior for R_env,wall is N(13.0, 3.0²) ft²·°F·h/Btu, reflecting code-minimum R-13 wall insulation with uncertainty spanning the range of actual field performance documented by Kramer et al. (Energy and Buildings, 2016).

Validation against blower door tests: the system's R_inf estimate can be converted to an equivalent air changes per hour at 50 Pa (ACH50) using the Sherman-Grimsrud model relating infiltration rate to leakage area, stack effect, and wind pressure. This provides a direct comparison with blower door test results when available.

8. Figures Description

Claims

  1. A system for non-invasive assessment of building envelope thermal performance, comprising: a data ingestion module receiving indoor temperature and HVAC system state time series from one or more smart thermostats; a weather data module receiving outdoor temperature, solar irradiance, and wind speed from external APIs; a response curve extractor identifying thermal transient events including HVAC-off coast-down, HVAC recovery, and solar gain events; and a grey-box parameter estimator fitting a lumped-parameter thermal network model to extracted response curves to estimate effective envelope thermal resistance, building thermal capacitance, and infiltration resistance.
  2. The system of claim 1, wherein the grey-box model is a second-order 2R2C thermal network with wind-speed-dependent infiltration resistance and solar gain terms, estimated using a continuous-discrete extended Kalman filter.
  3. The system of claim 1, further comprising a Bayesian longitudinal tracking module that treats individual event parameter estimates as noisy observations of slowly-varying true parameters, using a particle filter with random-walk state dynamics to produce posterior distributions over envelope thermal parameters that improve in precision as more events are observed over weeks and months.
  4. The system of claim 3, further comprising an anomaly detection module that generates alerts when the posterior mean of effective thermal resistance declines at a rate exceeding a configurable threshold over a trailing window, or when a step change exceeding a configurable significance level occurs within a short temporal window.
  5. The system of claim 1, wherein the entire inference pipeline executes locally on a home automation hub or single-board computer, transmitting no indoor temperature data off-premises, with weather API queries using only zip code as location identifier.
  6. A method for detecting building envelope degradation comprising: continuously collecting indoor temperature and HVAC state data from a smart thermostat and outdoor weather data from a public API; identifying thermal transient events in the data stream; fitting a parametric thermal model to each event to estimate envelope thermal resistance and infiltration resistance; aggregating estimates over time using Bayesian state-space modeling; and alerting when estimated parameters exhibit statistically significant degradation trends or acute step changes.
  7. The method of claim 6, further comprising multi-zone differential diagnosis in buildings with two or more smart thermostats, wherein zone-specific parameter changes are compared to isolate degradation to a specific building region and suggest likely failure modes based on zone location and building vintage.
  8. The method of claim 6, wherein initial parameter priors are set using publicly available building stock data including construction vintage, climate zone, and code-era insulation requirements from RECS and IECC databases.
  9. The method of claim 6, further comprising validation against blower door test results by converting the inferred infiltration resistance to equivalent ACH50 using the Sherman-Grimsrud model.
  10. The system of claim 1, wherein HVAC equipment capacity is inferred from the maximum sustained rate of temperature change during steady-state operation, enabling separation of envelope thermal resistance from HVAC efficiency degradation when both change simultaneously.

Prior Art References

  1. EIA — Buildings sector energy consumption — 40% of U.S. energy use
  2. DOE Building America Program — Pre-2000 housing stock insulation deficiency estimates
  3. Fokaides & Kalogirou, Energy and Buildings 2011 — Quantitative infrared thermography for U-value estimation (±15% accuracy)
  4. Desogus et al., Energy and Buildings 2011 — In-situ U-value measurement with heat flux meters per ISO 9869-1
  5. Statista — U.S. smart thermostat market penetration
  6. Bacher & Madsen, Applied Energy 2011 — Grey-box modelling of building thermal behaviour with RC models
  7. PNNL Residential Prototype Building Models — Internal heat gain assumptions
  8. RECS — Residential Energy Consumption Survey — Building stock characteristics
  9. IECC — International Energy Conservation Code — Code-vintage insulation requirements
  10. Kramer et al., Energy and Buildings 2016 — Field performance of residential wall insulation
  11. Sherman & Grimsrud infiltration model — Relating leakage area to infiltration rate
  12. OpenWeatherMap API — Public weather data source
  13. Open-Meteo API — Free, open-source weather API
  14. Google Nest Device Access API — Smart thermostat data access
  15. Ecobee Developer API — Smart thermostat data access
  16. ASTM E779 — Standard test method for determining air leakage rate by fan pressurization

Implementation Notes

A reference implementation targeting Raspberry Pi 4 with Python 3.11, NumPy, and SciPy can execute the full pipeline (curve extraction, CD-EKF fitting, particle filter update) in under 2 seconds per event on a single Cortex-A72 core. The particle filter with 1,000 particles requires approximately 12 MB of working memory. SQLite stores the historical parameter posterior (Gaussian mixture parameters plus raw event metadata) at roughly 1 KB per event, yielding under 500 KB per year per zone.

For fleet deployment across a utility territory, the optional anonymized aggregation layer can identify neighborhoods with accelerating envelope degradation, enabling targeted weatherization program outreach. A utility serving 100,000 smart-thermostat-equipped homes can process fleet-level statistics on a single 4-core VM, since individual inference runs locally on each home hub.