Name: Google Cloud / BigQuery / GitHub Skill -- Valor Investigations
Author: ValorInvestigator

Google Cloud / BigQuery / GitHub Skill -- Valor Investigations | Skills Pool

python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\evidence_search.py" "your question here"
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\evidence_search.py" "question" 10          # more results
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\evidence_search.py" "question" 5 --json    # JSON output

python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\bq_query.py" "SELECT COUNT(*) FROM \`valorinvestigates.valor_investigations.bingaman_documents\`"

python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\github_search.py" "search terms"
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\github_search.py" "PETA OHSU" --repo dhs-public-records-filing
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\github_search.py" --list-repos
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\github_search.py" "search" --json

python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\case_law_search.py" "search terms"
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\case_law_search.py" "attorney fees" --context 5   # more context lines
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\case_law_search.py" --cases                        # list all cases with citations
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\case_law_search.py" --list                         # list analysis files
python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\case_law_search.py" "burden" --json                # JSON output

User says...	Use...	Why
"What do we have on Emily Cooper?"	Evidence Search	Natural language question about a person
"How many documents mention Cooper?"	SQL	Needs an exact count
"What medications was Russell given?"	Evidence Search	Investigative question needing synthesis
"List all unique drugs_mentioned values"	SQL	Structural query about field values
"Search the database for hospice fraud"	Evidence Search	Investigative research question
"Show me the table schema"	SQL	Database admin question
"What evidence supports the visitation claim?"	Evidence Search	Complex question needing AI reasoning
"Count rows in all tables"	SQL	Administrative count
"What case law do we have on AG orders?"	GitHub Search	Legal research in repo markdown files
"What's in the DHS repo?"	GitHub Search	Repo contents/structure
"Find the PETA v OHSU research"	GitHub Search	Previously written research docs
"What legal memos exist?"	GitHub Search	Synthesized analysis documents
"Search GitHub for fee shifting"	GitHub Search	Explicit GitHub search request
"What case says stonewalling = denial?"	Case Law Search	Legal holding/doctrine question
"What's the citation for Merrick?"	Case Law Search	Specific case citation lookup
"What supports the attorney fees argument?"	Case Law Search	Legal argument support
"What's the burden of proof standard?"	Case Law Search	Legal doctrine question
"List all the cases we have"	Case Law Search (--cases)	Case inventory request
"What does PETA v OHSU say about delay?"	Case Law Search	Specific case analysis question

python "C:\Users\Big Levi\.claude\skills\gcloud-bigquery\scripts\bq_startup.py"

import json, urllib.request, sys, io

sys.stdout = io.TextIOWrapper(sys.stdout.buffer, encoding='utf-8', errors='replace')

SERVICE_ACCOUNT_KEY = r'C:\Users\Big Levi\.claude\keys\valorinvestigates-bigquery.json'
BQ_API = 'https://bigquery.googleapis.com/bigquery/v2/projects/valorinvestigates/queries'

def get_token():
    import google.oauth2.service_account as sa
    import google.auth.transport.requests as tr
    creds = sa.Credentials.from_service_account_file(
        SERVICE_ACCOUNT_KEY,
        scopes=['https://www.googleapis.com/auth/bigquery']
    )
    creds.refresh(tr.Request())
    return creds.token

def bq_query(sql, token, timeout_ms=60000):
    body = json.dumps({
        'query': sql,
        'useLegacySql': False,
        'timeoutMs': timeout_ms
    }).encode()
    req = urllib.request.Request(BQ_API, data=body, headers={
        'Authorization': f'Bearer {token}',
        'Content-Type': 'application/json'
    })
    resp = urllib.request.urlopen(req)
    return json.loads(resp.read())

token = get_token()

Table	Rows	Purpose
`entities`	0	People, orgs, drugs extracted
`evidence_facts`	0	Verified facts with Bates numbers
`metadata_forensics`	0	PDF metadata analysis
`witness_statements`	0	Witness/insider accounts
`global_knowledge_graph`	0	Entity relationships
`task_queue`	0	Pending investigation tasks

SELECT document_id, file_name, file_path, category
FROM `valorinvestigates.valor_investigations.bingaman_documents`
WHERE LOWER(extracted_text) LIKE LOWER('%search term%')

SELECT dc.chunk_id, dc.chunk_text, dc.page_number, d.file_name
FROM `valorinvestigates.valor_investigations.document_chunks` dc
JOIN `valorinvestigates.valor_investigations.bingaman_documents` d
  ON dc.doc_id = d.document_id
WHERE LOWER(dc.chunk_text) LIKE LOWER('%search term%')
ORDER BY d.file_name, dc.chunk_index

SELECT category, COUNT(*) as doc_count
FROM `valorinvestigates.valor_investigations.bingaman_documents`
GROUP BY category
ORDER BY doc_count DESC

SELECT document_id, file_name, extracted_text
FROM `valorinvestigates.valor_investigations.bingaman_documents`
WHERE document_id = 'THE_DOC_ID'

sql = """UPDATE `valorinvestigates.valor_investigations.bingaman_documents`
SET category = 'new_category'
WHERE document_id = 'some_id'"""
result = bq_query(sql, token, timeout_ms=120000)

result = bq_query(sql, token)
all_rows = result.get('rows', [])

while 'pageToken' in result:
    page_token = result['pageToken']
    job_id = result['jobReference']['jobId']
    page_url = f'https://bigquery.googleapis.com/bigquery/v2/projects/valorinvestigates/queries/{job_id}?pageToken={page_token}'
    req = urllib.request.Request(page_url, headers={
        'Authorization': f'Bearer {token}',
        'Content-Type': 'application/json'
    })
    result = json.loads(urllib.request.urlopen(req).read())
    all_rows.extend(result.get('rows', []))

Error	Meaning	Action
HTTP 401	Token stale	Call `get_token()` again and retry once
HTTP 403	Permission denied	Check project/dataset name, tell user
HTTP 404	Table not found	Check table name against schema reference
HTTP 400	Bad SQL	Fix the query syntax
AUTH_EXPIRED	Key file missing/bad	Tell user key is missing or corrupted
Timeout	Query too slow	Increase timeoutMs or simplify query

Repo	Description	Key Content
`dhs-public-records-filing`	DHS lawsuit filing package	Complaint, exhibits 1-13, case law (13 Oregon cases), legal research memos
`Bingaman-Case-Evidence`	Complete evidence archive	Court filings, metadata forensics, medical records, witness declarations
`odhs-state-health-law`	Oregon State Plan research	SPPC/PC20, Medicare, OAR citations, eligibility rules
`oregon-legal-research`	Case law + statutes DB	Searchable Oregon guardianship/probate law
`the-Quarterback`	Research repository	General investigation research

Google Cloud / BigQuery / GitHub Skill -- Valor Investigations

THE ABSOLUTE RULE

FOUR MODES OF QUERYING

Mode 1: EVIDENCE SEARCH (natural language -- use this FIRST)

Google Cloud / BigQuery / GitHub Skill -- Valor Investigations

THE ABSOLUTE RULE

FOUR MODES OF QUERYING

Mode 1: EVIDENCE SEARCH (natural language -- use this FIRST)

Mode 2: SQL QUERY (structured data -- for specific lookups)

Mode 3: GITHUB SEARCH (research files, case law, legal memos)

Mode 4: CASE LAW SEARCH (legal arguments -- for lawsuit support)

DECISION GUIDE: Which mode?

NEW SESSION STARTUP -- DO THIS FIRST

AUTHENTICATION

Inline token pattern (for quick one-offs)

PROJECT DETAILS

SCHEMA QUICK REFERENCE

Primary evidence table (used by Vertex AI search)

Chunked text table

Pipeline tables (being populated)

Legacy table (older import, fewer files)

COMMON SQL PATTERNS

Full-text search across all documents

Search document chunks (more granular, includes page numbers)

Count documents by category

Get full text for a specific document

DML operations (UPDATE, DELETE, INSERT)

HANDLING LARGE RESULTS

ERROR HANDLING

GITHUB REPOS QUICK REFERENCE

CRITICAL REMINDERS

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags