Projects & Impact

NL2Insights: Enterprise Text-to-SQL at Scale

Overview: NL2Insights is a fully automated pipeline for converting natural language to SQL queries, powering flagship IBM products and transforming how enterprises interact with their data.

Impact:

Key Innovations:

Recognition:


BIRD Leaderboard: #1 in Text-to-SQL Benchmark

Achievement: Led IBM Granite Text-to-SQL models to first place in both tracks of the prestigious BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation) leaderboard.

Challenge: BIRD features over 12,751 question-SQL pairs across 95 databases from 37 professional fields, emphasizing accuracy and execution efficiency.

Why It Matters:

Technical Approach:

Recognition:


Multilingual Text2SQL

Capability: Extended Text2SQL to support multiple languages, making data accessible to global workforce.

Deployment:

Impact:


AutoDO: Automated Decision Optimization

Overview: An end-to-end automated system for solving sequential decision-making problems using data and knowledge-driven approaches.

Contributions:

Timeline: Jan. 2022 – Mar. 2023

Recognition: Tutorial/Lab organizer at AAAI 2023: “Automated AI For Decision Optimization with Reinforcement Learning”


Evaluating Robustness in Multi-Agent Reinforcement Learning

Innovation: Proposed the first model-based adversarial attacks (cMBA) for cooperative multi-agent reinforcement learning.

Key Contributions:

Publication: IEEE International Conference on Data Mining (ICDM) 2023

Patent: Filed patent application on systematic approach for evaluating robustness (Sep. 2022)


Federated Learning with Douglas-Rachford Splitting

Innovation: Proposed FedDR and asyncFedDR algorithms for federated learning with best-known communication complexity.

Key Features:

Publication: NeurIPS 2021 (35th Conference on Neural Information Processing Systems)

Impact: Advanced the state-of-the-art in federated optimization for non-convex problems


Patents & Intellectual Property

Filed 9 patent applications on:


Skills & Technologies

Large Language Models: Fine-tuning, prompt engineering, reasoning systems, RAG (Retrieval-Augmented Generation)

Programming: Python, TensorFlow, Keras, PyTorch, Scikit-learn, C/C++, MATLAB

ML/AI: Deep learning, reinforcement learning, federated learning, optimization algorithms

Databases: SQL, Text-to-SQL, schema linking, query optimization

Enterprise AI: Production deployment, scalability, safety & security, multilingual systems