Data Downloads
All 72,165 source citations and 462 program elements are available as Parquet exports. Datasets marked “cited” include row-level citation linkage; others are citation-tier pending (see methodology).
Schemas & data dictionary: every Parquet file embeds its column schema (readable via DuckDB DESCRIBE), and the data explorer lists each dataset with row counts and column descriptions.
Bundle built: . Files are in Apache Parquet format, readable with DuckDB, pandas, R arrow/duckdb, or any Parquet-compatible tool. Each file includes the same provenance metadata that backs on-screen figures.
Project-level cost figures from J-book XML (R-2/P-40 exhibits), with page-level PDF citation.
jbook_details.parquetBudget line items from R-1 and P-1 Excel rollups (workbook-cited).
budget_lines.parquetFY2024–FY2026 budget trajectory per program element.
fct_budget_trajectory.parquetLDA lobbying filings by family key and filing year — income, expense, totals.
fct_influence.parquetProgram mentions extracted from LDA filing issue text.
fct_program_lobbying.parquetBudget-to-contract crosswalk (confidence-tiered: high/medium/low), with per-link derived citations.
fct_budget_to_awards.parquetCongressional-district obligation aggregates from USAspending place-of-performance data.
dim_geography.parquetState-level per-capita spending with Census population data.
fct_state_per_capita.parquetHHI contractor concentration scores per program element.
fct_program_concentration.parquetAgency-level improper-payment exposure estimates (derived).
fct_improper_exposure.parquetNamed lobbyists from LDA filings with revolving-door flags and disclosing-filing provenance columns.
dim_lobbyists.parquetMission/accomplishment narratives from J-book exhibits.
jbook_narratives.parquetAll Source citations (jbook_pdf + workbook + lda_filing), keyed by fact_id.
citations.parquetAdditional assets (not in table above)
citations.parquet— all source citations linking fact_ids to PDF pages, workbook cells, and LDA filing UUIDspdfs/— 34 SHA-named J-book PDFs (~149 MB total)workbooks/— 3 R-1/P-1 Excel rollup files