Machine learning for federal missions.

Production ML systems from a Kaggle Top 200 data scientist. Computer vision, NLP, forecasting, anomaly detection — shipped into live federal environments, not slide decks.

Top 200
Kaggle Global
0.1%
Of 200,000+ DS
5+
Federal ML Systems
15yr
Engineering

What we deliver

Most federal ML work stalls in the same place: a promising notebook on a laptop that never becomes a production system. We close that gap. Every engagement is scoped toward a working model serving real users, not a PowerPoint deck.

  • Computer vision — object detection, segmentation, OCR, document understanding, satellite/aerial imagery analysis, medical imaging.
  • Natural language processing — classification, named entity recognition, summarization, embeddings-based search, fine-tuned domain models.
  • Time-series forecasting — demand, supply, workforce, equipment, budget. Classical (Prophet, ARIMA) through deep (N-BEATS, TFT).
  • Anomaly detection — fraud, network intrusion, supply chain deviation, health surveillance, predictive maintenance.
  • Tabular prediction — gradient-boosted trees (XGBoost, LightGBM, CatBoost) plus ensembling. The unglamorous workhorse that wins most federal problems.
  • MLOps — model registry, drift monitoring, shadow deployment, A/B testing, rollback paths.

Why Kaggle matters for federal ML

A Kaggle Top 200 ranking means you've consistently beaten 199,800+ other data scientists on held-out test sets across diverse problem types. It is a direct publicly-benchmarked, adversarial test of modeling skill that exists.

For federal work, it matters because most federal ML projects fail not from lack of data but from modeling choices — wrong architecture, leaked features, overfit validation, no uncertainty quantification. Competition ML training directly addresses these failure modes.

Stack

  • Modeling: PyTorch, scikit-learn, XGBoost, LightGBM, CatBoost, HuggingFace Transformers, timm, detectron2.
  • Experiment tracking: MLflow, Weights & Biases.
  • Serving: TorchServe, ONNX Runtime, Triton Inference Server, FastAPI wrappers.
  • Feature stores & data: Feast, Parquet + DuckDB, Postgres, Spark for batch feature engineering.
  • Cloud: AWS GovCloud (SageMaker, EC2 GPU), Azure Government, on-prem CUDA clusters.

Past performance highlights

SAMHSA

Production machine learning system

Designed and deployed a live machine learning system at the Substance Abuse and Mental Health Services Administration. Serves real users. Passed full federal security review. In production today. See full past performance →

Federal ML, answered.
What machine learning work can a small business realistically deliver for federal agencies?

Everything a prime can deliver for the actual ML workload — we've shipped production ML at SAMHSA that passed federal security review. What a small business does not deliver is the 200-person prime overhead wrapped around a 2-person ML team. For the ML itself, a lean specialist delivers faster and with more direct accountability.

What is your Kaggle ranking?

Top 200 globally out of 200,000+ active data scientists, putting Bo in the top 0.1%. Experience spans NLP, computer vision, tabular prediction, and ensemble methods — all competition-proven on held-out adversarial test sets. Profile: kaggle.com/bopengiowa.

Have you shipped production ML in a federal environment?

Yes. Production ML at SAMHSA (Substance Abuse and Mental Health Services Administration). Live. Real users. Passed federal security review. Not a prototype, not a POC.

Can you handle classified or air-gapped ML environments?

Design and modeling yes. Production deployment in classified environments requires the contracting path to include facility clearance — typically through a cleared prime partner. We'd partner rather than pretend to hold clearances we don't.

Do you do MLOps and monitoring, or just model training?

Both. A federal ML system without drift monitoring, model registry, and rollback paths is a liability. We build these in from day one, not bolt them on later.

Often deployed together.
1 business day response

Turn data into mission advantage.

Production ML for federal agencies. Ready to deliver.

[email protected]
UEI Y2JVCZXT9HP5CAGE 1AYQ0NAICS 541512SAM.GOV ACTIVE