Skip to content
View ibrahim1023's full-sized avatar
💻
Working from home
💻
Working from home

Block or report ibrahim1023

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ibrahim1023/README.md

Ibrahim Arshad

Building deterministic AI systems, not demos.


About

Software engineer focused on agentic AI systems.

I build:

  • multi-agent systems with structured orchestration
  • deterministic harnesses for reliability
  • evaluation-aware AI systems (focused on correctness, not just output)

Focus areas:

  • agent orchestration (LangChain, LangGraph, custom harnesses)
  • context + tool design (ACI-style systems)
  • AI evaluation (failure detection, reproducibility, ranking)

Currently working across

→ AI systems (LLM workflows, agent orchestration, evaluation)
→ Distributed systems + automation tooling


Featured Project: ci-rootcause

Deterministic multi-agent CI debugging engine.

Most AI CI tools summarize logs.
They fail because CI failures are execution systems, not text problems.

ci-rootcause reconstructs execution and identifies the actual root cause.

What it does

  • builds a failure graph from CI logs
  • detects the first failure (not downstream symptoms)
  • analyzes diffs to link code changes to breakages
  • ranks root causes using deterministic scoring
  • generates evidence-backed fixes (LLM-constrained)
  • produces structured outputs: ci-rca.json, ci-rca.md

LLM Support (Pluggable)

LLMs are used selectively for:

  • explanation
  • fix suggestions

Supported providers:

  • Ollama (local models)
  • OpenAI
  • Anthropic
  • Google Gemini

LLMs are never used for scoring or confidence.

Why it matters

  • no hallucinated root causes
  • reproducible outputs across runs
  • confidence is computed, not generated
  • works with both local and hosted models
  • designed for real CI workflows, not demos

Core Principles

  • determinism over heuristics
  • systems over prompts
  • evaluation before optimization
  • evidence over plausibility

Output

  • ci-rca.json → machine-readable root cause
  • ci-rca.md → human-readable explanation

Open Source Focus (2026)

  • contributing to real AI systems (not toy projects)
  • building production-grade agent workflows
  • focusing on correctness, determinism, and evaluation
  • shipping systems that can be reasoned about and verified

Approach: → consistent, high-frequency contributions → focus on real issues that get merged

PRs

  • [Open] #6535 fix(ethereum): handle trace_filter traces missing result.output via c… in graphprotocol/graph-node
  • [Open] #2331 fix(langgraph): handle null thread checkpoint in RemoteGraph.getState in langchain-ai/langgraphjs
  • [Open] #5461 fix(converter): fall back on invalid JSON-like partial matches in crewAIInc/crewAI
  • [Open] #5545 fix(flow,task): handle pydantic outputs in guardrail retries and checkpoint serialization in crewAIInc/crewAI
  • [Open] #2316 fix(sdk): Backfill truncated history for regenerate branching in langchain-ai/langgraphjs
  • [Open] #21386 fix(azureaisearch): preserve falsy metadata values in index mapping in run-llama/llama_index
  • [Open] #21336 fix(elasticsearch): split sync and async store paths in run-llama/llama_index

Engineering Principles

  • Determinism over heuristics where possible
  • Systems over prompts
  • Evaluation before optimization
  • Evidence > plausibility

Skills

AI Systems

  • LangChain
  • LangGraph
  • multi-agent orchestration
  • LLM tool + context design

Infrastructure / Systems

  • Python
  • TypeScript
  • Rust

Focus Areas

  • agent harness design
  • evaluation systems
  • reproducibility + reliability

Contact

Pinned Loading

  1. chain-autoresearch-mlx chain-autoresearch-mlx Public

    Forked from trevin-creator/autoresearch-mlx

    Apple Silicon (MLX) port of Karpathy's autoresearch — autonomous AI research loops on Mac, no PyTorch required.

    Solidity

  2. ci-rootcause ci-rootcause Public

    Python 1