Skip to content
View joshuaFordyce's full-sized avatar

Block or report joshuaFordyce

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
joshuaFordyce/README.md

Applied AI & MLOps | Distributed Data Systems | Reducing Developer Friction

Hi I'm Joshua. I architect and build the "glue code" that connections cutting edge llm and RAG tools with high-scale data infrastucture. My core focus is reducing developer friction by creating robust, observable, and performant extensions for modern MLOps frameworks

I'm passionate about building the custom solutions and integrations that bring advanced AI out of the lab and into production for user-focused use cases.

1. Data Infrastucture Starburst Contributions:

  • Starburst Contributiosn & Field Enablement:
    • My work centers on developer and operator efficiency.
      • Contributions to Minitrino (lightweight testing envirionments)
      • Comprehensive Trino Runbooks for advanced debugging
      • The Terraform toolkit for declarative data infrastucture deployment

Expertise:

  • This experiences has made me an expert in deploying, optimizing and troubleshooting complex Distributed Systems and Kubernetes/Container infrastucture for petabyte-scale data access.

2. AI Integrations & Extension (Custom Solutions):

  • I'v been applying my infrastructure knowledge to the open source AI landscape, specifically by customizing and extending foundational tools to handle enterprise data complexity
    • RAG & LLM Tooling:
      • I build direct contributions and glue-code customizations to RAG libraries like LangChain and LlamaINdex, ensuring they can robustly connect to and query structured data sources like Trino
    • MLOps:
      • I create specialized extensions on modern MLOps frameworks like Truss and Opik to solve production challenges like structured output, observability, and efficient model serving

3. Kubernetes Based Trino Deployment Guides

  • I'v been applying my Kubernetes experties to adding Deployment guides for deploying performant Trino Clusters on Kubernetes

Highlighted Open Source Contributions with Pinned Repos

These are currently in-progress or recently completed projects that demonstrate my focus on field-ready AI solutions:

Lets Connect

I am actively seeking roles like forward deployed engineer, field deployment engineer and Post Sales Solution Architect where I can directly bridge the gap between cutting-edge AI research and real-world customer challenges

Pinned Loading

  1. minitrino minitrino Public

    Forked from jefflester/minitrino

    A tool that makes it easy to run modular Trino environments locally.

    Python

  2. Semantrino Semantrino Public

    Python

  3. llama_index llama_index Public

    Forked from run-llama/llama_index

    LlamaIndex is the leading framework for building LLM-powered agents over your data.

    Python

  4. keda keda Public

    Forked from kedacore/keda

    KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes

    Go

  5. trino-gateway-api trino-gateway-api Public

  6. trino-k8s-cilium-perf trino-k8s-cilium-perf Public

    Shell