The Houston CBAS presentation contains 61 distinct metrics, charts, and tables. Here’s what RealHouse can replicate today, what we could reach with investment in paid data and local government requests, and what remains truly proprietary.
Of the 61 metrics in the Houston CBAS presentation (Land Tejas Private Consult, Feb 2026), we categorized each by whether RealHouse can replicate it using publicly available data:
The Houston presentation is heavily weighted toward proprietary metrics (community rankings, lot pipeline, builder price catalogs, corridor deep-dives) that CBAS produces through field verification and specialized data collection. The federal data we can automate provides strong macro context but doesn’t cover the new-home operational layer without local lifecycle data.
These 8 metrics can be fully replicated for any U.S. metro using standardized federal data sources. RealHouse already has connectors built for all of them.
| Houston CBAS Metric | RealHouse Source | Cadence | Status |
|---|---|---|---|
| Metro population by year | Census PEP metro estimates | Annual | Built |
| MSA population growth rankings | Census PEP metro estimates | Annual | Built |
| County population growth rankings | Census PEP county estimates | Annual | Built |
| Components of change (births, deaths, migration) | Census PEP components of change | Annual | Built |
| Employment growth trend (top MSAs) | BLS LAUS (place-of-residence) | Monthly | Built |
| YoY employment growth (Houston) | BLS LAUS metro series | Monthly | Built |
| Total nonfarm payroll employment | BLS CES metro series | Monthly | Built |
| Mortgage rates trend | FRED (Freddie Mac PMMS 30-yr) | Weekly | Built |
packages/ingest/src/connectors/census_pop.py
packages/ingest/src/connectors/bls.py
packages/ingest/src/connectors/fred.py
These 6 metrics can be partially replicated, but the public data source has a meaningful gap compared to what CBAS shows.
| Houston CBAS Metric | RealHouse Source | What’s Missing |
|---|---|---|
| Employment by sector (table) | BLS CES industry series | Some Houston-specific sector detail (medical/energy) may require Texas Workforce Commission data, which has inconsistent availability. |
| Annual employment growth by sector | BLS CES industry series | Same as above — sector taxonomy may not perfectly match CBAS’s presentation groupings. |
| Annual new-home closings (financed) | HMDA purchase originations | Only covers mortgage-financed purchases. Misses all-cash transactions (significant in new-home market). Annual cadence, not quarterly. |
| Quarterly closings KPI | HMDA purchase originations | HMDA is annual bulk release, not quarterly. Would need recorder deed data for quarterly granularity. |
| Consumer confidence | FRED alternative sentiment series | Conference Board index requires paid subscription. Public alternatives (U. Michigan consumer sentiment via FRED) exist but are different measures. |
| SF permits vs estimated annual starts | Census BPS (permits only) | We have the permits bar chart. The “estimated starts” overlay line is what CBAS adds — that requires either local lifecycle data or a model. |
These 5 metrics are the core of the “new-home operational layer” — they track the construction pipeline from start through completion and sale. No federal source publishes these at the metro level. To produce them, you need actual permit inspection events from local jurisdictions (foundation inspection = start; certificate of occupancy = completion).
Federal data gives us permits (how many were authorized) but not starts (how many actually broke ground), under construction (how many are in progress), or completions (how many finished). The Census Survey of Construction (SOC) publishes these concepts nationally and regionally, but not at the metro level. CBAS fills this gap with field-verified lifecycle tracking.
| Houston CBAS Metric | What We’d Need | Current Status |
|---|---|---|
| Annual / quarterly new-home starts | Foundation/slab inspection dates from local permit portals | TPIA filed |
| Homes under construction (stock) | Start dates + completion dates to compute stock-flow: UC = started − completed | TPIA filed |
| Completions (quarterly) | Certificate of occupancy (CO) or final inspection dates from local permit portals | TPIA filed |
| Finished vacant inventory | Completions (CO dates) + sales signal (recorder deeds or HMDA) to compute: completed but not yet sold | Phase 3 |
| Finished vacant months’ supply | Derived: finished vacant inventory ÷ trailing monthly closings | Phase 3 |
For Houston, we’ve filed a TPIA (Texas Public Information Act) request with the Houston Permitting Center for bulk historical permit and inspection data. If fulfilled, this gives us foundation inspection dates (starts) and CO dates (completions) for the City of Houston jurisdiction. Harris County ePermits covers unincorporated areas.
For other metros, the equivalent approach is to find open-data portals (ArcGIS, Socrata) or file equivalent public records requests in each jurisdiction.
The remaining 42 metrics (69% of the presentation) have no standardized public data source. These are the metrics that make CBAS’s product proprietary — they require field research, MLS membership, builder relationships, or specialized lot-pipeline tracking that cannot be automated from government data.
| CBAS Metric | Why No Public Equivalent |
|---|---|
| MLS resale snapshot (sales, prices, DOM, active listings, sale/list ratio) | Requires MLS membership data feed (HAR in Houston). Not government data. No public API. |
| Resale closings vs active inventory time series | Same — MLS proprietary data. |
Would need: MLS data-sharing agreement or RETS/RESO Web API access from the local REALTOR association.
| CBAS Metric | Why No Public Equivalent |
|---|---|
| Vacant developed lots (VDL) count | No federal equivalent. Requires parcel-level lot status classification from local plat records and field verification. |
| VDL months’ supply | Derived from VDL + starts — both unavailable. |
| Future lots count | Requires tracking subdivision plat applications, engineering permits, and development pipeline status. Not standardized. |
| Future lots with active site work | Requires classifying infrastructure permits (utilities, paving) or field/remote-sensing verification of site activity. |
Would need: Local plat/subdivision records + parcel GIS + engineering permit data + field verification. This is a core CBAS differentiator.
| CBAS Metric | Why No Public Equivalent |
|---|---|
| Median new-home price + price per SF over time | Requires MLS new-construction data or builder price sheets. FHFA HPI is an index, not a median price. |
| Floorplan QoQ price direction stats | CBAS analyzes 8,000+ floorplans from builder catalogs. No public source for base prices. |
| Starts/closings by base price band | Requires linking each unit to a floorplan base price — proprietary catalog data. |
| Finished vacant by base price band | Same — needs price catalog + inventory linkage. |
| Cheapest floorplan profile | Requires maintained builder floorplan catalog with current base prices. |
| Most expensive floorplan profiles | Same as above. |
Would need: Builder website scraping for floorplan catalogs + MLS new-construction listings.
The Houston presentation includes detailed corridor deep-dives (288 South, Grand Magnolia, River Ranch, Lago Mar East) with the following metric template repeated per corridor:
All of these require CBAS-defined submarket/corridor polygon boundaries, a community database with normalized subdivision identifiers, full lifecycle tracking (starts/closings at the address level), parcel-level lot sizing, and builder attribution.
Would need: Everything in the “needs local data” tier above, plus: submarket polygon definitions, community/subdivision reference table, parcel GIS with lot frontage, and builder name normalization. This is the full CBAS platform.
| CBAS Metric | Why No Public Equivalent |
|---|---|
| 2026 employment growth forecast by sector | Sourced from Greater Houston Partnership. Not a federal series. Would need to manually curate forecast publications. |
| Houston SF starts forecast range (2025, 2026) | CBAS’s own proprietary forecast model output. No public equivalent. |
| Community-level quarterly metrics tables | Requires the full CBAS community database and lifecycle tracking — not a single data source but an entire system. |
The 47 metrics in tiers 3 and 4 aren’t all equally out of reach. Some just need a paid license. Others need time and coordination with local government. Only a subset truly requires boots on the ground. Here’s a data-acquisition-by-data-acquisition breakdown of how close we could get if investment were available.
License a data feed from the local REALTOR association (HAR in Houston) via the RESO Web API standard. Alternatively, use a national aggregator like CoreLogic or ATTOM.
Unlocks:
Residual gap: MLS new-construction tagging varies by market. List price ≠ base price (CBAS tracks base prices before incentives).
Obtain bulk permit + inspection event data from local jurisdictions via TPIA/FOIA requests or negotiate API access to platforms like Accela, Tyler, or OpenGov. For many metros, open-data portals (ArcGIS, Socrata) provide this data for free.
Unlocks:
Residual gap: Inspection event definitions vary by jurisdiction (Houston uses “Foundation”; another metro may use “Footing” or “Slab”). Normalization required per metro.
Commercial property data providers like ATTOM Data and CoreLogic maintain national deed/transaction databases that consolidate county recorder data. Alternatively, file TPIA requests directly with county clerk offices for bulk deed exports.
Unlocks:
Residual gap: Texas does not mandate sale price disclosure on deeds, so Houston recorder data often has blank price fields. Must cross-reference with HMDA or MLS for price. Some cash transfers lack clear “new construction” flags.
Most county assessor offices publish parcel GIS data with lot boundaries, frontage/area dimensions, land use codes, and ownership. HCAD (Harris County) offers quarterly GIS downloads. This is nearly always free or very cheap.
Unlocks:
Residual gap: ~5% of parcels have missing/incomplete frontage fields in some counties. Quality varies.
Monthly satellite imagery (3m resolution from Planet Labs, or higher from Maxar/Nearmap) enables detection of ground disturbance, site clearing, and construction activity through temporal change analysis. This is the closest substitute for CBAS’s field verification of lot pipeline status.
Unlocks:
Residual gap: Satellite can detect visible surface changes but misses underground utilities, ownership intent, liens/encumbrances, and lot “readiness to build” status. False positives from non-residential earthwork. Monthly cadence means 2–4 week lag.
Maintain a daily/weekly scrape of builder websites for floorplan catalogs (base prices, specs, community assignments). Supplement with direct data feeds negotiated with major public builders (Lennar, Pulte, Toll Brothers, etc.). The top 10–20 builders typically cover ~60% of metro production volume.
Unlocks:
Residual gap: Base price ≠ list price. CBAS distinguishes genuine price cuts from incentive changes — scraped data conflates them (~20–30% of observed “price changes” are actually incentive shifts). Smaller builders often don’t list base prices publicly. Plan ID normalization when builders rename/retire plans is manual.
Define communities from plat polygons + permit text field parsing + builder roster curation. Define submarkets/corridors as ZIP code clusters, county subareas, or planning-department market areas. Validate against 2+ years of permit/sales data. This is labor, not a data license — someone has to curate and maintain the reference tables.
Unlocks:
Residual gap: CBAS submarket/corridor boundaries are proprietary and informed by market knowledge (traffic patterns, school districts, builder focus areas). Our ZIP-based or plat-based approximations will differ. Community names are marketing constructs that don’t always match legal subdivision boundaries. Builder attribution from permit applicant fields is ~60–70% accurate (GC ≠ builder).
Contract 2–4 field specialists per metro to sample 20–30% of VDL inventory quarterly and validate lot status, site-work conditions, and community boundaries. This is what CBAS does — “visually verified” is their core differentiator.
Unlocks:
What remains: Even with field sampling, ~10–15% of lot inventory status changes between sampling visits. CBAS likely has higher cadence and deeper builder relationships built over years.
Total estimated cost to reach ~80–85% CBAS parity for a single metro (Houston), organized by phase:
| Data Acquisition | One-Time | Annual | Metrics Unlocked | Parity |
|---|---|---|---|---|
| Federal spine (existing) | — | — | 14 (8 full + 6 partial) | Done |
| Permit lifecycle (TPIA/API) | $15–30k | $5–15k | ~10 (pipeline + geo slicing) | ~95% |
| Parcel GIS (assessor download) | Free–$5k | — | ~4 (lot-size segmentation) | ~95% |
| Transaction data (ATTOM/recorder) | $5–10k | $20–40k | ~3 (total closings + quarterly) | ~90% |
| MLS license (HAR / RESO) | — | $8–15k | ~3 (resale + new-home pricing) | ~100% |
| Builder scraping + feeds | $10–20k | $10–30k | ~5 (pricing, floorplan catalog) | ~85% |
| Satellite imagery (Planet/Maxar) | — | $15–50k | ~4 (VDL, site work detection) | ~75% |
| Community/submarket curation | $10–25k | $5–15k | ~12 (corridors, rankings, maps) | ~70% |
| Field research (sampling) | — | $30–100k | ~6 (VDL validation, conditions) | ~80% |
| Total (first metro) | $40–90k | $93–265k | ~47 additional metrics | ~80–85% |
Subsequent metros cost ~60–70% of the first (reuse connector code, scraping infrastructure, and satellite processing pipelines; marginal cost is mainly per-metro TPIA/API negotiation and community curation labor).
Even with maximum investment, some aspects of the CBAS product remain genuinely difficult to replicate:
The critical insight: permit lifecycle data is the single biggest unlock. For ~$20–35k one-time investment (TPIA + parcel GIS), we go from 14 metrics to 28 — covering the full construction pipeline (starts, UC, completions, finished vacant) that federal data alone cannot provide.
Adding paid data sources (MLS, ATTOM, satellite, builder scraping) pushes coverage to ~43 metrics at ~80–85% CBAS parity. The remaining ~18 metrics require community curation and field validation — the work that makes CBAS a specialized product.
Bottom line: With ~$40–90k one-time and ~$93–265k/year, a single metro can reach 80–85% of what CBAS offers. The last 15–20% is the hardest and most expensive — submarket boundary precision, builder relationships, lot-level field verification, and years of historical calibration.