FLSIG Insight

Customs data as a market-research primitive: a primer

Published December 15, 2025

This primer introduces customs and bill-of-lading data as a market-research input for firms whose research budget does not stretch to bespoke syndicated reports. The objective is to give the reader a structural understanding of where this data comes from, what each layer of it represents, and where its limits are. FLSIG analysis suggests that mid-market exporters routinely either over-trust or under-trust customs data, and that both errors stem from the same root: an incomplete model of how the data is produced.

What "customs data" actually refers to

The term is loose, and that looseness is the first source of confusion. In FLSIG's usage, "customs data" refers to records generated by the act of clearing goods across a national border, in either direction. These records exist because virtually every customs administration in the world requires importers and exporters (or their brokers) to file structured declarations against a national tariff schedule, almost always derived from the Harmonized System maintained by the World Customs Organization. The WCO's introduction to the Harmonized System remains the authoritative starting point for understanding the underlying nomenclature.

Researchers at FLSIG distinguish three operationally different layers within the broader customs-data category.

Aggregated trade statistics, published by national statistics offices, the UN Comtrade database, the WTO's data portal, and Eurostat for EU member states. These are typically reported at the HS six- or eight-digit level, by partner country, in monthly or quarterly windows.
Transactional customs records, sourced either through national disclosure regimes (where they exist) or through commercial intermediaries who aggregate, clean, and re-sell them. These records may identify specific importers, exporters, or both, with line-item descriptions, values, weights, and volumes.
Bill-of-lading data, generated by ocean carriers and consolidators and surfaced primarily by the US Customs and Border Protection regime under the AMS rule. This is a parallel data stream to formal customs declarations and is treated separately in the FLSIG brief on bill-of-lading economics.

Each layer has a different cost, latency, and reliability profile. Confusing them is the most common mistake FLSIG observes among practitioners new to the field.

What questions customs data can credibly answer

Used carefully, customs data is well-suited to a defined set of market-research questions:

Has this product category been imported into the target market in commercially relevant volumes over the past 24–36 months?
Which countries are the dominant origin sources, and how concentrated is the supply base?
What is the implied unit price (value divided by quantity) in the target market, and how has it trended?
Which named importers, if any, are receiving comparable goods, and at what cadence?
Are there visible seasonal patterns in arrivals that should inform a launch calendar?

These questions share a property: they are answerable, at least in part, from the structure of the data itself, without requiring inferences that the data was never designed to support.

Where the data fails or misleads

FLSIG's analysis indicates that misuse of customs data tends to fall into recognisable patterns.

First, unit price is not landed cost and not consumer price. The declared value at the border, typically expressed on a CIF or FOB basis depending on jurisdiction, includes neither domestic distribution costs, marketing, retailer margin, nor VAT/sales tax in most cases. A mid-market exporter that reads a customs unit price as a proxy for what the consumer pays will systematically misestimate addressable margin.

Second, HS classification is not uniform across jurisdictions beyond the six-digit level. Two countries may both report imports of a product but classify them under different eight- or ten-digit subheadings, producing volumes that do not reconcile. This is a feature, not a bug, of the system and is one of the reasons HS-code work has become a strategic discipline in its own right. FLSIG addresses this layer separately in HS-code intelligence: from classification to strategic insight.

Third, named-importer data is often incomplete in ways that are not obvious from the records themselves. Some firms route shipments through forwarders or related entities that appear in the data as the importer of record, masking the ultimate buyer. Others use distribution centres in third countries, so that a shipment "to" country X is in fact destined for country Y. The data does not declare these patterns; they must be inferred from the cadence and structure of records.

Fourth, gaps and reporting lags are not random. Smaller economies, particularly those without strong statistical offices, may publish data quarterly or annually rather than monthly, and revisions can be substantial. UNCTAD's recurring statistical compendia, including the UNCTAD Handbook of Statistics, document these reporting asymmetries explicitly and are useful for calibrating expectations.

How to integrate customs data into an entry decision

FLSIG recommends treating customs data as one of three or four orthogonal signals, never as the sole input. The framework that performs well in practice is to use aggregated trade statistics to define a candidate market shortlist, transactional records to triangulate the identity and behaviour of incumbent suppliers and buyers, and the firm's own quotation traffic and digital demand signals to validate that the historical pattern is still live.

The broader case for using customs data as a primary market-research input — rather than a confirmatory afterthought — is treated in FLSIG's review of foreign market entry frameworks. The compliance implications of the HS classifications surfaced through this work, including the documentation discipline they require, are a separate topic and are taken up elsewhere in FLSIG's working papers.

A note on access and cost

Aggregated trade statistics from UN Comtrade, WTO, OECD, and most major national agencies are free at point of use. Transactional records vary widely. Some jurisdictions (notably the United States for maritime imports, and several Latin American economies) make detailed records available either freely or at low cost through statutory disclosure regimes. Others (most of Europe, China, and most of Asia outside specific carve-outs) do not. Commercial aggregators bridge the gap at price points that scale with granularity, recency, and named-party detail. The economics of that aggregator market shape, in practical terms, what the mid-market exporter can afford to see.

The discipline of customs-data analysis is, finally, a discipline of knowing what each row of data represents and what it does not. Practitioners who treat the records as raw signal rather than as the output of a regulated administrative process will continue to draw confident conclusions from records that cannot support them.