I build production-grade, safety-first AI systems and translate them into clear business & clinical outcomes.
Focus areas: Advanced multi‑agent platforms,RL powered LLMs, RAG/LLMOps, healthcare, Fintech, Insuretech & life sciences, multilingual/edge AI.
Contact: [email protected] • LinkedIn • GitHub @invincible-jha
- Builder: Architect & ship agentic systems, hybrid RAG, long‑context/tool‑calling LLMs, speech/vision pipelines, and compliant MLOps.
- Translator: Turn “models” into KPI‑positive capabilities tied to ROI, governance, and adoption playbooks.
Operating model: Shadow → Guarded‑assist → Full assist → Automate • Cite‑or‑abstain grounding • OpenTelemetry/rollback • KPI‑first with N & 95% CI.
- 22 production systems • 6 agentic platforms • 15 healthcare/biomed programs • 14+ years building AI
AI/ML: ASR/NLP/NMT (wav2vec2, Conformer, RNN‑T) • CV/ViT • GNNs • Diffusion • MARL • Causal inference
GenAI/RAG: Tool‑calling LLMs • 32–128k context • hybrid retrieval (sparse+dense+cross‑encoder) • evaluator suites
MLOps/LLMOps: OTel tracing • blue/green • drift/rollback • token/latency budgets • versioned data/prompts/models
Compliance/Safety: HIPAA/GDPR/IVDR/MDR • consent ledgers • immutable audit logs • conformal risk bands
-
CX Positive Behavior Shift (2019–2025)
Planner/Orchestrator • Policy guardrails • Tool‑calling LLM • Hybrid RAG • Long‑context
Impact: AHT↓ • CSAT↑ • Policy violations↓; cohort A/B with significance. -
Rights‑Aware Expert Knowledge Base (2022–2025)
Crawler→OCR/ASR→Deduper→Attribution guard→Contradiction detector • Cite‑always prompts
Impact: Retrieval P@k↑ • Citation coverage↑ • Contradiction SLA. -
EU Regulatory Advanced RAG (2022–2025)
Clause anchors • Cross‑encoder rerank • Control mapping • Gap analysis • Hash‑chained evidence
Impact: Mapping F1↑ • Audit duration↓ • Replayable evidence packs. -
Mental‑Health Digital Twin Companion (2022–2025)
Conformal risk bands • On‑device modes • Federated personalization • Safety escalation
Impact: Time‑to‑escalation↓ • Adherence↑ • FN/FP tracked. -
GCC Youth Soccer Discovery (2022–2025)
OC‑SORT/ByteTrack MOT • TimeSformer/ViViT events • Scout assistant
Impact: IDF1↑ • Event mAP↑ • High scout agreement κ. -
Health & Wellness Twins (2022–2025)
Environment/Resident twins • MPC/safe‑RL HVAC • Matter/Thread • Explainable automations
Impact: IAQ time‑in‑range↑ • Sleep metrics↑ • Low rollback rate.
The six agentic platforms above mirror the structure/timelines in your master portfolio. :contentReference[oaicite:4]{index=4}
-
Pucho Multilingual Assistant (2014–2020)
GMM/DNN‑HMM → CTC/attention → RNN‑T • mBERT/XLM‑R NLU • TFLite quantization • USSD/SMS
Impact: Millions of users • WER↓ • High intent accuracy across low‑resource languages. -
DARPA Humanoid Robotics (2012–2015)
ROS • LIDAR/IMU fusion • CNN detection • A*/RRT • Whole‑body QP control
Impact: Robust sim‑to‑real task completion in hazardous settings. -
Pucho Inc. Next‑Gen Multilingual Platform (2020– )
wav2vec2/Conformer/RNN‑T • XLM‑R/mT5 adapters • Federated learning • Device‑tier routing
Impact: Better code‑switch NLU • Latency/cost SLOs • Fairness dashboards. -
AI Diagnostics — NeuroScan (2021–2025)
nnU‑Net → 3D ViT • MC‑dropout • Conformal prediction • DICOMweb
Impact: Time‑to‑read↓ • High sensitivity/specificity • Well‑calibrated ECE. -
GenomicCare — Personalized Treatment (2021–2025)
Multimodal transformers (genomic+clinical+imaging) • Survival heads • Uplift modeling • SHAP
Impact: Strong C‑index/AUC • Adverse events↓ • Subgroup parity. -
VitalChain — Secure Health Data Exchange (2021–2025)
Hyperledger Fabric • SMART/Bulk FHIR • ZK proofs • HSM/KMS • PQ‑crypto roadmap
Impact: Consent provenance • Fast audit replay • TPS/latency within SLA. -
MOLECULE‑X — AI Drug Discovery (2020–2025)
Message‑passing GNNs • 3D/diffusion generators • Retrosynthesis • Pareto optimization
Impact: Hit‑rate↑ • Diversity↑ • Tox liabilities↓; wet‑lab validation. -
COVID‑19 Response & DIYA Vocal Biomarkers (2020–2022)
Docking + ML rescoring • wav2vec2 encoders • Calibrated risk bands (research, non‑diagnostic)
Impact: Faster screening throughput • Strong triage AUC; privacy controls. -
Next‑Gen mRNA Design (2020–2025)
Seq‑to‑expression transformers • Epitope binding • Structure/manufacturability constraints
Impact: Top‑k enrichment↑ • Expression↑ • Yield↑; candidates advanced. -
Global Health Access — Edge AI (2020–2025)
Sub‑100MB SLMs • Int8/fp16 • Offline queues • Federated pilots
Impact: Referral lag↓ • Adherence↑ • Rugged, multilingual deployments. -
Multi‑Agent Healthcare Operations (2021–2025)
QMIX/MA‑PPO • Safe‑RL • OPE (IPS/DR) • Queueing sims
Impact: Throughput↑ • Time‑to‑diagnosis↓ • Fairness dashboards. -
Domain LLMs for Biomedical Research (2021–2025)
Long‑context • Hybrid/Graph‑RAG • Citation‑first prompting • Evaluators
Impact: Hallucination↓ • Citation coverage↑ • Reviewer agreement↑. -
Small Language/Signal Models for Edge Healthcare (2021–2025)
Distillation/quantization • NNAPI/Metal • Conformal triage
Impact: Low p95 latency • Minimal battery impact • On‑device privacy. -
Quantum‑Enhanced Discovery (2021–2025)
QUBO/QAOA/VQE subroutines • Error mitigation • Classical fallbacks
Impact: Measured enrichment vs cost • Reproducible experiments. -
Decentralized Health Data Management (2021–2025)
SMART/Bulk FHIR • Selective disclosure • Anomaly detection • Residency controls
Impact: TPS/latency SLAs • Replayable audits • Strong DSAR flows.
The fifteen programs above follow your wording and impact focus as captured in the consolidated portfolio. :contentReference[oaicite:5]{index=5}
Clients masked for public GitHub. Outcomes are headline deltas; full N & 95% CI available privately.
- Tier‑1 Insurer (2017): AI‑powered underwriting document extraction — 45% cycle‑time reduction.
- Tier‑1 Insurer (2017): Voice‑AI form automation — 35% AHT reduction.
- Tier‑1 Telecom ( 2018): Indic‑language voice AI — 32% first‑contact‑resolution improvement.
- Micro‑Insurance Provider ( 2018): Indian‑language automation — 40% conversion increase.
- Retail Real‑Estate Group (2018): Mall voice assistants — 55% promo‑engagement uplift.
- Mobile OS Distribution (2019): System‑level voice automation — p95 latency ≈ 500 ms.
- Smartphone OEM (2019–2020): On‑device voice query — model footprint <100 MB.
- Retail Tech ISV (APAC, 2016): Revenue optimization — MAPE reduction 30%.
Language packs used across deployments: English, Hindi, Bengali, Tamil, Kannada, Marathi & European Languages (add others as applicable).
- 2014–2018: GMM/DNN‑HMM → CTC/attention; phrase‑based SMT → attention NMT; Kaldi/CMU Sphinx; early CNNs.
- 2019–2022: mBERT/XLM‑R, RNN‑T, wav2vec2/Conformer; long‑context transformers; TFLite/ONNX; federated pilots.
- 2023–2025: Tool‑calling LLMs, hybrid/Graph‑RAG, evaluator suites; conformal prediction; agentic planners; ZK proofs.
Governance throughout: consent/audit ledgers, cite‑or‑abstain, OpenTelemetry traces, blue/green + rollback.
Open to co‑building responsible agentic systems in healthcare/Cybersecurity/Fintech/Insuretech/Enterprise Saas/regulated domains, and to advisory/fractional roles establishing evaluation, safety, and LLMOps standards.