Machine learning for federal missions.

Production ML systems from a Kaggle Top 200 data scientist. Computer vision, NLP, forecasting, anomaly detection — shipped into live federal environments, not slide decks.

Discuss your data See past performance

Top 200

Kaggle Global

0.1%

Of 200,000+ DS

a federal health agency

Production ML, in production today

15yr

Founder engineering tenure

What we deliver

Every engagement is scoped toward a working model serving real users, not a PowerPoint deck — from data intake through deployed inference with monitoring, ATO alignment, and a handoff package the receiving team can operate.

Top 200

Kaggle global ranking

a federal health agency

Confirmed production ML delivery

FedRAMP High

ATO-ready ML infrastructure

MACHINE LEARNING — what we track

p99

latency budget

eval

domain harness

ATO

NIST 800-53

cost

per mission call

drift

continuous eval

Computer vision — object detection, segmentation, OCR, document understanding, satellite/aerial imagery analysis, medical imaging.
Natural language processing — classification, named entity recognition, summarization, embeddings-based search, fine-tuned domain models.
Time-series forecasting — demand, supply, workforce, equipment, budget. Classical (Prophet, ARIMA) through deep (N-BEATS, TFT).
Anomaly detection — fraud, network intrusion, supply chain deviation, health surveillance, predictive maintenance.
Tabular prediction — gradient-boosted trees (XGBoost, LightGBM, CatBoost) plus ensembling. The workhorse pattern behind most production federal prediction systems.
MLOps — model registry, drift monitoring, shadow deployment, A/B testing, rollback paths.

FEDERAL ML DEPLOYMENT COVERAGE

Scikit-learn and XGBoost on GovCloud

92%

PyTorch (on-prem and GovCloud)

88%

SageMaker Government

85%

Azure ML Government

82%

Databricks MLflow on GovCloud

88%

Why Kaggle matters for federal ML

A Kaggle Top 200 ranking means you've consistently beaten 199,800+ other data scientists on held-out test sets across diverse problem types. It is a direct publicly-benchmarked, adversarial test of modeling skill that exists.

For federal work, it matters because many federal ML projects fall short not from lack of data but from modeling choices — wrong architecture, leaked features, overfit validation, no uncertainty quantification. Competition ML training directly addresses these failure modes.

Stack

Modeling

PyTorch, scikit-learn, XGBoost, LightGBM, CatBoost, HuggingFace Transformers, timm, detectron2.

Experiment tracking

MLflow, Weights & Biases.

Serving

TorchServe, ONNX Runtime, Triton Inference Server, FastAPI wrappers.

Feature stores & data

Feast, Parquet + DuckDB, Postgres, Spark for batch feature engineering.

Cloud

AWS GovCloud (SageMaker, EC2 GPU), Azure Government, on-prem CUDA clusters.

Past performance highlights

a federal health agency

Production machine learning system

Designed and deployed a live machine learning system at the a federal health agency. Serves real users. Passed full federal security review. In production today. See full past performance →

Frequently Asked

Federal ML, answered.

What machine learning work can a small business realistically deliver for federal agencies?

Everything a prime can deliver for the actual ML workload — we've shipped production ML at a federal health agency that passed federal security review. What a small business does not deliver is the 200-person prime overhead wrapped around a 2-person ML team. For the ML itself, a lean specialist delivers faster and with more direct accountability.

What is your Kaggle ranking?

Top 200 globally out of 200,000+ active data scientists, putting Bo in the top 0.1%. Experience spans NLP, computer vision, tabular prediction, and ensemble methods — all competition-proven on held-out adversarial test sets. Profile: kaggle.com/bopengiowa.

Have you shipped production ML in a federal environment?

Yes. Production ML at a federal health agency (a federal health agency). Live. Real users. Passed federal security review. Not a prototype, not a POC.

Can you handle classified or air-gapped ML environments?

Design and modeling yes. Production deployment in classified environments requires the contracting path to include facility clearance — typically through a cleared prime partner. We'd partner rather than pretend to hold clearances we don't.

Do you do MLOps and monitoring, or just model training?

Both. A federal ML system without drift monitoring, model registry, and rollback paths is a liability. We build these in from day one, not bolt them on later.

Related capabilities

Often deployed together.

1 business day response

Turn data into mission advantage.

Production ML for federal agencies. Ready to deliver.

Contact the PI See which agencies we serve →

UEI Y2JVCZXT9HP5CAGE 1AYQ0NAICS 541512SAM.GOV ACTIVE

Machine learning for federal missions.

What we deliver

Why Kaggle matters for federal ML

Stack

Past performance highlights

Production machine learning system

Agentic AI & LLM Systems

Data Engineering

Cloud Infrastructure

Turn data into mission advantage.