Skip to content
Fiscal Receipts

Data Downloads

All 72,165 source citations and 462 program elements are available as Parquet exports. Datasets marked “cited” include row-level citation linkage; others are citation-tier pending (see methodology).

Schemas & data dictionary: every Parquet file embeds its column schema (readable via DuckDB DESCRIBE), and the data explorer lists each dataset with row counts and column descriptions.

Bundle built: . Files are in Apache Parquet format, readable with DuckDB, pandas, R arrow/duckdb, or any Parquet-compatible tool. Each file includes the same provenance metadata that backs on-screen figures.

dim_programscited

326 DoD R&D and procurement program elements with metadata.

dim_programs.parquet
jbook_detailscited

Project-level cost figures from J-book XML (R-2/P-40 exhibits), with page-level PDF citation.

jbook_details.parquet
budget_linescited

Budget line items from R-1 and P-1 Excel rollups (workbook-cited).

budget_lines.parquet
fct_budget_trajectorycited

FY2024–FY2026 budget trajectory per program element.

fct_budget_trajectory.parquet
dim_entitiescited

Top-200 contractor families by total federal obligation.

dim_entities.parquet
fct_influencecited

LDA lobbying filings by family key and filing year — income, expense, totals.

fct_influence.parquet
fct_program_lobbyingcited

Program mentions extracted from LDA filing issue text.

fct_program_lobbying.parquet
fct_budget_to_awardscited

Budget-to-contract crosswalk (confidence-tiered: high/medium/low), with per-link derived citations.

fct_budget_to_awards.parquet
dim_geographycited

Congressional-district obligation aggregates from USAspending place-of-performance data.

dim_geography.parquet
fct_state_per_capitacited

State-level per-capita spending with Census population data.

fct_state_per_capita.parquet
fct_program_concentrationcited

HHI contractor concentration scores per program element.

fct_program_concentration.parquet
fct_improper_exposurecited

Agency-level improper-payment exposure estimates (derived).

fct_improper_exposure.parquet
dim_lobbyistscited

Named lobbyists from LDA filings with revolving-door flags and disclosing-filing provenance columns.

dim_lobbyists.parquet
jbook_narrativescited

Mission/accomplishment narratives from J-book exhibits.

jbook_narratives.parquet
citationscited

All Source citations (jbook_pdf + workbook + lda_filing), keyed by fact_id.

citations.parquet

Additional assets (not in table above)

  • citations.parquet all source citations linking fact_ids to PDF pages, workbook cells, and LDA filing UUIDs
  • pdfs/34 SHA-named J-book PDFs (~149 MB total)
  • workbooks/3 R-1/P-1 Excel rollup files