William Foland, PhD

GenAI Scientist & Architect

I turn complex human workflows into deterministic AI engines — from architecture through production delivery. PhD in neural language understanding, 19 US patents, 20+ years shipping.

About

AI systems architect with a PhD in neural language understanding and 20+ years delivering production software across software, silicon, and applied machine learning.

Itinitek builds real-time multimodal platforms, vision analytics, and domain-specific LLM systems that streamline complex human workflows. From Harvard's AI teaching simulations to healthcare and industrial automation, we clarify the problem, design the architecture, and close the execution gap quickly — with deterministic checks, not brittle automation.

Engines

Production systems and the architecture behind them. Each card is a look at the real work — vision pipelines, clinical extractors, and real-time orchestration engineered to run reliably under load.

Spatial-Semantic Pipeline

2D ISO drawings → 3D system model

A vision-LLM pipeline that ingests a stack of flat 2D isometric drawings and reconstructs them into a single validated 3D system model. Perception (vision/OCR) is separated from reasoning (LLM) so each stage tunes independently, with geometric consistency checks driving error rates to safety-critical levels — then you chat with your system in plain English: "find all butterfly valves in the 2-inch line." Weeks → hours.

  • Vision-LLM
  • OCR
  • Geometry Checks
  • Oil & Gas
Clinical Semantic Extractor

Freeform clinical text → structured ontologies

An LLM engine embedded directly into clinical workflows — with an efficient UI for occasional human-guided correction — that reads freeform patient notes and grounds every finding to standard ontologies (ICD-10, SNOMED, HPO, LOINC, RxNorm). Exact negation handling ("denies history of"[NEGATED]) keeps it reliable in a regulated setting. High-quality output supports sophisticated downstream workflows.

  • ICD-10 · SNOMED
  • HPO · LOINC
  • Negation
  • RxNorm
Real-Time Multimodal Orchestration

One reasoning model driving four live avatars

A real-time meeting simulator where a single LLM controlled four distinct video avatars at once — coordinating speech, timing, and behavior under tight latency over WebRTC + Next.js + FastAPI. Speech-driven scenarios allow active student participation in the conversation. Delivered to 900+ MBA students; a closed-loop eval system stress-tested coordination with synthetic dialogue.

  • WebRTC
  • Multi-Agent
  • Low Latency
  • Harvard
Private-Subnet MCP Brokerage

Augment agents with secured MCP servers

A BAA-secured LLM agent sits outside the VPC; an MCP broker inside the private subnet routes each call to controlled MCP servers — augmenting the agent's capabilities within secure execution boundaries instead of open-ended access. Every MCP server runs secured inside the VPC and exposes a typed schema that whitelists exactly which operations it permits — so the agent gets approved, validated actions only, never direct access to the data or systems behind them.

  • MCP Servers
  • Pydantic
  • VPC Isolation
  • BAA / HIPAA

Engagement Models

Three ways to work together as a fractional GenAI scientist & architect — from strategic oversight to dropping a proven engine into your cloud.

Strategic Advisory

Retainer-based

Architecture roadmapping, multi-model cost/latency routing analysis, vendor vetting, evaluation strategy, and security governance for agent action spaces. Async-first strategic oversight plus scheduled executive syncs.

Discuss advisory

Embedded Architect

Project-based

Direct ownership of engineering teams — designing planner-executor-verifier self-correcting loops and deterministic code-as-judge constraint checks. Active, project-bound engineering leadership from prototype to production.

Discuss a project

Deployment

Accelerator integration

Licensing, optimization, and deployment of a proven engine — the Spatial-Semantic or Clinical Extractor core — directly into your cloud environment. A focused 90-day execution sprint that bypasses standard R&D timelines.

Discuss deployment

Core Expertise

Agentic AI Systems

Multi-agent orchestration for production workflows, including planner-executor-verifier loops, MCP tool use, and stateful versus stateless reasoning strategies.

  • Multi-Agent
  • MCP
  • Verification Loops
  • Retry Logic

Evaluation & Reliability

Closed-loop evaluation systems, failure-mode analysis, and iterative tuning for AI products that need to work reliably under real constraints.

  • Dataset Construction
  • LLM-as-Judge
  • Guardrails
  • System Tuning

Generative AI & LLMs

RAG pipelines, fine-tuning, prompt engineering, and model routing for domain-specific systems that improve expert workflows instead of adding brittle automation.

  • RAG
  • MCP Servers
  • PEFT / LoRA
  • Prompt Engineering

Real-Time Multimodal

Low-latency voice, video, and avatar systems built for human interaction, teaching simulations, and other applications where timing and responsiveness matter.

  • WebRTC
  • Voice / Speech
  • Video Avatars
  • Low Latency

Computer Vision

Vision pipelines for industrial and edge workloads, combining detection, OCR, parsing, and multimodal reasoning to recover usable structure from complex visual inputs.

  • YOLO
  • OCR
  • Video Analytics
  • Multimodal Reasoning

Infrastructure & Delivery

End-to-end system design spanning model serving, cloud infrastructure, and application delivery across prototypes, pilots, and production systems.

  • AWS / Azure
  • PyTorch
  • TensorFlow
  • FastAPI / NextJS
  • Python / C++ / JS

Credentials

PhD Neural Language Understanding
19 US Patents
20+ Years Production Delivery

Education

PhD in Computer Science

University of Colorado, Boulder, CO (December 2017)

Dissertation: Natural Language Understanding: Deep Learning for Abstract Meaning Representation

Deep learning models similar to those that power ChatGPT and modern LLMs.

MS Computer Science

University of Colorado, Boulder

BS Electrical Engineering

University of Colorado, Boulder

Selected Publications

Full history & patents 20+ years across AI, silicon, and systems

Before pioneering GenAI solutions, I spent years architecting mixed-signal integrated circuits and read channels for data storage. That foundation in low-level signal processing, error correction, and hardware constraints directly informs how I build efficient, scalable, robust AI systems today.

Jan 2026 - Mar 2026

Founder & Chief Consultant

Itinitek Ltd, Golden, CO

  • Designed an ingestion engine combining specialized AI agents with deterministic engineering checks so extracted data is verified against physical and geometric constraints before landing in the database.
  • Built workflows to route ambiguous cases for fast human review while preserving provenance and validation history on every extracted value.
Mar 2023 - Sept 2023

Founder & Chief Consultant

Itinitek Ltd, Golden, CO

  • Fine-tuned LLaMA for the legal domain in 3 weeks and built a RAG pipeline that handled 500+ page contract sets.
  • Built an AWS pipeline combining LLMs and SQL for real-time options analysis; the proof of concept helped drive a client funding round.
July 2020 - Feb 2023

Chief Scientist

Lilac Cloud Inc, Cupertino, CA

  • Architected vision-AI pipelines for edge computing, combining Dockerized GPU services with CUDA-accelerated model execution for real-time video analytics in constrained environments.
  • Built custom FFmpeg filters and Libav extensions to run inference inside the video pipeline, enabling object recognition and event detection without interrupting frame flow.
  • Implemented GAN-based imperceptible video watermarking and delivered low-latency frame processing for live sporting events.
May 2018 - June 2020

Cofounder & Chief Scientist

Bolt Analytics Corp, Santa Clara, CA

  • Architected time-series anomaly detection for financial services using CNN, RNN, and transformer approaches.
  • Led an offshore team building automated diabetic retinopathy detection from retina scans using TensorFlow.
Aug 2016 - Apr 2018

Research Scientist

CU Computational Language and Education Research, Boulder, CO

  • Developed recurrent NLP models for automated speech recognition and dialog analysis in K-12 STEM education.
  • Built a system to help teachers reflect on and improve instructional practices through AI-powered feedback.
May 2012 - July 2014

Expert Technical Consultant

Dovel & Luner, LLP, Santa Monica, CA

  • Served as an expert witness in semiconductor patent litigation, contributing technical analysis that supported successful client outcomes.
May 2009 - May 2012

Founder & Independent Developer

Itinitek Ltd, Golden, CO

  • Architected, developed, and marketed six GPS, graphics, and skiing applications for iPhone (Objective-C), concurrent with MS and PhD coursework at the University of Colorado.
Nov 2003 - May 2009

Senior Director, Optical Products

Marvell Semiconductor Inc, Santa Clara, CA

  • Led a 150-engineer division developing full SoC solutions spanning ARM core, servo, read channel, and DSP on one chip.
  • Managed the complete product lifecycle from architecture through silicon to production across multiple generations of mixed-signal optical storage controllers.

U.S. Patents

8,559,287 Method and System for Fault Protection Using a Linear Feedback Shift Register
8,498,186 Circuits, Architectures, Apparatuses, Systems, Algorithms and Methods and Software for Optimum Power Calibration for Optical Disc Recording
8,391,115 Method of Improving Quality of Optical Recording Using Circumferentially Repeatable Compensation
8,345,523 Method and Apparatus for Optimizing Optical Recording
8,060,674 Systems and Methods for Data Storage Devices and Controllers
7,475,173 Integrated Disc Drive Controller
7,379,452 Synchronous Read Channel
6,594,716 Mixed-Signal Single-Chip Integrated System Electronics for Data Storage Devices

19 U.S. patents spanning signal processing, read channel design, mixed-signal integrated circuits, and optical recording systems. Full patent list available on request.

Let's architect your AI system.

Available for fractional GenAI leadership, architecture, and engine deployment.