FOLD
Given a desired biochemical function, use RL to search the space of protein structures and generate one that satisfies it.
Category
AI/ML
Role
Lead AI Engineer
Timeline
6 Months
Tech Stack
FOLD
System Architecture
A high-level overview of the technical components and data flow that power FOLD.
The Approach
Architected a distributed multi-agent system using FastAPI for high-performance streaming and WebSockets for real-time agent-to-user communication. Implemented a sophisticated Agent Orchestrator that handles LLM routing and dynamic task planning via Directed Acyclic Graphs (DAGs). The system features a self-healing loop where agents evaluate and correct their own outputs. For memory, I integrated a RAG pipeline using pgvector and hierarchical compression to maintain long-term context without token overflow. The frontend provides a real-time dashboard for monitoring agent thought processes and task execution progress.
Key Challenges
- Managing state and consistency across multiple autonomous agents working on shared tasks.
- Reducing latency in RAG retrieval while maintaining high recall over millions of document chunks.
- Implementing a robust self-healing logic that prevents agents from entering infinite correction loops.
- Designing a streaming architecture that handles long-running background tasks without blocking the UI.