CV

William (Bill) Watson

https://nextbillyonair.github.io

New York City, NY, US

Summary

Research on autonomous, adaptive AI agents for asymmetric environments that personalize at test-time, with a focus on privacy, learning from limited feedback, dynamic memory systems, and maintaining diverse, uncorrelated behaviors.

Education

M.S.E. in Computer Science
2019-05-01
The Johns Hopkins University
Courses: Computer Vision, Machine Translation, Human-Robot Interaction, Deep Learning, Artificial Intelligence
B.S. in Computer Science; Minor in Applied Mathematics & Statistics
2018-05-01
The Johns Hopkins University
Courses: Computer Vision, Machine Translation, Human-Robot Interaction, Deep Learning, Artificial Intelligence

Work Experience

AI Scientist, Core Artificial Intelligence Technology
2024-06-01 -
Millennium Management
Agentic systems, retrieval, memory, and evaluation for enterprise LLM platforms.
AI Research Lead & Vice President, AI Research (Supervisor: Manuela Veloso)
2021-12-01 - 2024-05-01
JPMorgan Chase & Co.
LLM/agent research for legal, retrieval, and reliability systems.
- Developed HalluciBot (AAAI 2025 Oral) for pre-generation hallucination-risk prediction and routing.
- Created LAW (COLING 2025 Oral; U.S. 12,298,970) for interactive querying over legal contracts; powered by FlowMind (ICAIF 2023).
- Introduced HiddenTables (EMNLP 2023) and developed BizGraphQA (SIGIR 2023).
Data Scientist, Ratings Data Science
2019-06-01 - 2021-11-01
S&P Global
Multimodal document understanding, semantic search, and NLP research for financial/risk workflows.
- Built a tabular auto-extractor (ICAIF 2020) to structure tables from image documents.
- Patented a ratings criteria semantic search & citation ranking engine (U.S. Patent 11,328,022).
- NLP research on earnings calls for Women CEOs During COVID-19 (AZBEES Gold Regional Award).

Skills

Programming

Python
LaTeX
SQL
JavaScript
HTML/CSS
Java
C/C++
OCaml

Machine Learning & Data Science

PyTorch
TensorFlow
Transformers
OpenCV
Scikit-Learn
Tesseract OCR

Cloud, Web, & DevOps

MCP
AWS
Docker
Redis
FastAPI
React
Streamlit
Git
Gunicorn
Plotly
OpenSearch/Elasticsearch

Research Interests

Reinforcement Learning
Retrieval-Augmented Generation (RAG)
Interpretability
Embodied AI
Agents

Publications

Is There No Such Thing as a Bad Question? H4R: HalluciBot For Ratiocination, Rewriting, Ranking, and Routing
2024
AAAI 2025 (Oral - Top 4%)
HalluciBot predicts hallucination risk before LLM generation and routes/rewrites queries for safer reasoning.
View Publication
LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
2024
COLING 2025 (Oral - Top 15%)
Agentic workflow system for interactive querying over legal contracts.
View Publication
FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning
2024
ICAIF 2024
Modular system combining sub-querying and expert swarms for financial intelligence tasks.
View Publication
HiddenTables & PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data Privacy Across a Myriad of Taxonomies
2024
EMNLP 2023
Privacy-preserving TableQA game/dataset; improves discovery and highlights schema/compositional weaknesses.
View Publication
FlowMind: Automatic Workflow Generation with LLMs
2024
ICAIF 2023 (Best Poster Runner-Up)
Automatically generates and refines agentic workflows with LLMs.
View Publication
BizGraphQA: A Dataset for Image-based Inference over Graph-structured Diagrams from Business Domains
2023
SIGIR 2023
Dataset for question answering over graph-structured business diagrams.
View Publication
Financial Table Extraction in Image Documents
2024
ICAIF 2020 (Oral)
Deep learning pipeline for detecting and transcribing financial tables in images while preserving structure.
View Publication
Modeling Color Terminology Across Thousands of Languages
2019
EMNLP-IJCNLP 2019
Cross-linguistic modeling of color terminology across many languages.
View Publication
QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting
2025
NeurIPS 2025 LAW (Workshop)
Contextual bandits for adaptive query rewriting to mitigate hallucinations.
View Publication
MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers
2025
AAAI 2025 PDLM (Workshop - Oral)
Robustness analysis using automated perturbations and answer collection.
View Publication
BuDDIE: A Business Document Dataset for Multi-task Information Extraction
2024
COLING 2025 FinNLP+FNP+LLMFinLegal (Workshop - Oral)
Multi-task information extraction dataset for business documents.
View Publication
Directed Criteria Citation Recommendation and Ranking Through Link Prediction
2024
ICAIF 2020 (Extended Abstract)
Link prediction approach for recommending and ranking financial citations.
View Publication
Customer Experience Focus Can Improve Equity And Credit Performance
2021
S&P Global Special Reports
Report on the relationship between customer experience focus and equity/credit performance.
View Publication
Leadership In Turbulent Times: Women CEOs During COVID-19
2022
S&P Global Special Reports (AZBEES Gold Regional Award)
Analysis of leadership strategies of women CEOs during COVID-19 using earnings call data.
View Publication
Development of Computer Vision and Image Processing Libraries at the National Synchrotron Light Source - II
2016
BNL Internship Reports 2016 (U.S. DOE, Brookhaven National Laboratory)
CV/ML library development for synchrotron imaging and scientific data processing.
View Publication
QUILL: Questions into Layout & Logic, or Learning How to Ask-Then-Draw for Guided Infographic Generation with Controllable Diffusion
2026
ACL 2026 (Target Submission)
Guided infographic generation from QA-derived layout and logic constraints.
View Publication
Stop Agreeing: Constrained GRPO with Answerer Ensembles for Sycophancy-Resistant Two-Stage QA
2026
ACL 2026 (Target Submission)
Constrained GRPO approach to reduce sycophancy and prompt-lottery variance in two-stage QA.
View Publication
Keeping TABs: Genetic Memory for Online Continual Learning with Thompson-Sampled Agentic Buffers
2026
ACL 2026 (Target Submission)
Thompson-sampled agentic buffers for genetic memory in online continual learning.
View Publication
No One Size Fits All: QueryBandits for Hallucination Mitigation
2026
ICLR 2026 (Under Review)
Online per-query rewrites that adapt on the fly to reduce hallucinations.
View Publication
What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance
2026
EACL 2026 (Under Review)
Measures how human-confusing linguistic features affect LLM performance.
View Publication
TASER: Table Agents for Schema-guided Extraction and Recommendation
2025
EACL 2026 (Under Review)
Schema-guided table agents for extraction and recommendation in structured domains.
View Publication
System and Method for Generating Order Agnostic Dynamic Citation Network
2025
U.S. Patent
U.S. Patent 12,443,662 • Filed June 14, 2024 • Issued October 14, 2025 • JPMorgan Chase & Co.
View Publication
Method and System for Automated Processing and Continuous Deployment of Subpoena Extraction Model
2025
U.S. Patent
U.S. Patent 12,437,572 • Filed November 21, 2022 • Issued October 7, 2025 • JPMorgan Chase & Co.
View Publication
Method and System for Automatic Workflow Generation by Large Language Models
2025
U.S. Patent
U.S. Patent 12,399,691 • Filed August 24, 2023 • Issued August 26, 2025 • JPMorgan Chase & Co.
View Publication
Method and System for Analyzing Natural Language Data by Using Domain-Specific Language Models
2025
U.S. Patent
U.S. Patent 12,298,970 • Filed July 3, 2023 • Issued April 30, 2025 • JPMorgan Chase & Co.
View Publication
System for Document Ranking by Phrase Importance
2022
U.S. Patent
U.S. Patent 11,328,022 • Filed March 17, 2020 • Issued April 20, 2022 • S&P Global
View Publication
Method and System for Information Extraction and Aggregation
2024
U.S. Patent Application
U.S. Patent App. 18/816,826 • Filed August 27th, 2024 • JPMorgan Chase & Co.
View Publication
Method & System of Training an Encoder Classifier Model in Predicting Hallucination of a Machine Learning Model before Generation of a Query
2024
U.S. Patent Application
U.S. Patent App. 18/806,279 • FiledAugust15th,2024 • JPMorganChase&Co.
View Publication
System and Method for Automatic Table Identification and Extraction in Documents
2024
U.S. Patent Application
U.S. Patent App. 18/767,115 • Filed July 9, 2024 • JPMorgan Chase & Co.
View Publication
Method and System for Improving Code Generation Quality of Large Language Model Through Code Guardrails
2024
U.S. Patent Application
U.S. Patent App. 18/675,688 • Filed May 28, 2024 • JPMorgan Chase & Co.
View Publication
System and Method for Vision-Assisted Approach for Graph Structured Extraction In Various Types of Documents
2024
U.S. Patent Application
U.S. Patent App. 18/666,318 • Filed May 16, 2024 • JPMorgan Chase & Co.
View Publication
System and Method for Implementing a Model That Predicts the Probability of Hallucination For Any Query Imposed to an LLM
2024
U.S. Patent Application
U.S. Patent App. 18/630,641 • Filed April 9, 2024 • JPMorgan Chase & Co.
View Publication
Method and System for Performing Table Question Answering Tasks While Preserving Data Privacy
2023
U.S. Patent Application
U.S. Patent App. 18/535,428 • Filed December 11, 2023 • Allowed November 21, 2025 • JPMorgan Chase & Co.
View Publication
Method and System for Evaluating Artificial Intelligence Models Via Perturbations
2023
U.S. Patent Application
U.S. Patent App. 18/396,014 • Filed December 26, 2023 • JPMorgan Chase & Co.
View Publication
Method and System for Forecasting Trading Behavior and Thematic Concepts
2023
U.S. Patent Application
U.S. Patent App. 18/377,121 • Filed October 5, 2023 • JPMorgan Chase & Co.
View Publication
Method and System for Forecasting Market Activity
2023
U.S. Patent Application
U.S. Patent App. 18/377,116 • Filed October 5, 2023 • JPMorgan Chase & Co.
View Publication
Method and System for Code Generation by Large Language Models
U.S. Patent Application
U.S. Patent App. 18/205,719 • Prov. Filed March 22, 2023 • Allowed December 3, 2025 • JPMorgan Chase & Co.
View Publication
Method and System for Generation of Insights from Regulatory Filings
2023
U.S. Patent Application
U.S. Patent App. 18/103,807 • Filed January 31, 2023 • JPMorgan Chase & Co.
View Publication
Method and System for Automation of Due Diligence
2022
U.S. Patent Application
U.S. Patent App. 17/658,383 • Filed April 7, 2022 • JPMorgan Chase & Co.
View Publication

Download CV as PDF View Markdown CV