Biomedical research moves at the speed of data. But for most research teams, data doesn’t move — it sits scattered across APIs, proprietary spreadsheets, clinical trial databases, and public repositories, waiting for someone to manually pull it together. That someone is usually a researcher. And the hours they spend gathering and cross-referencing information are hours not spent on the scientific work only they can do.
The Biomedical Research Institute needed that bottleneck eliminated. They came to Aegasis Labs with a clear brief: build an intelligent research assistant capable of autonomously handling the entire data gathering and synthesis workflow — across multiple structured and unstructured sources — and delivering cited, human-readable research outputs that scientists could act on immediately.
We built CalibrBV: a modular, multi-agent AI research platform that decomposes complex biomedical queries, retrieves data from live APIs, proprietary spreadsheets, and public research sources simultaneously, synthesizes findings across all of them, and returns contextual, citation-backed reports in a conversational interface. The result was a reduction in manual data collection and synthesis time of over 70% — and a research environment where scientists spend their time on interpretation and innovation, not information retrieval.
The Biomedical Research Institute operates at the intersection of data-intensive science and high-stakes decision-making. Their researchers work across drug development, clinical trials, company analysis, and regulatory filings — domains where the quality and speed of evidence gathering directly affects the quality and speed of scientific outcomes.
Their data environment reflects the complexity of modern biomedical research. Relevant information lives across GlobalData APIs covering drugs, trials, deals, companies, and filings; proprietary Citeline spreadsheets holding structured clinical and market data; and the broader public research landscape accessible through open-source tools and literature databases. Each source has different formats, different access methods, and different levels of structure.
Getting a complete picture of any research question required navigating all of them — manually, sequentially, and with significant time investment before any actual analysis could begin. The institute needed a platform that could do that work autonomously, accurately, and at speed. CalibrBV was the answer.
Biomedical research has a data problem that doesn’t get talked about enough — not the problem of too little data, but the problem of too much friction in accessing it.
A researcher pursuing a hypothesis about a drug compound’s clinical trial landscape might need to query GlobalData for active trials, cross-reference Citeline spreadsheet data on related compounds, scan recent literature for relevant findings, and pull company filing information to understand the commercial context. Each of those steps requires a different tool, a different interface, and a different mental context switch. Done manually, a thorough research synthesis on a single query can consume a significant portion of a working day — before any actual scientific thinking has begun.
This isn’t an edge case. It’s the routine experience of research teams operating in data-rich, multi-source environments. And the consequences compound at scale. Hypothesis validation slows down. Decision-making cycles lengthen. Researchers with specialized scientific expertise spend meaningful time on tasks that are fundamentally mechanical — data retrieval, format normalization, cross-source reconciliation — rather than on the interpretation and innovation they were hired for.
There’s also an accuracy dimension. Manual synthesis across multiple sources introduces inconsistency. A researcher pulling data from three different platforms over the course of an afternoon may be working with information at different points in time, formatted differently, with no automatic reconciliation. Citations get lost. Context gets dropped. The resulting synthesis reflects the limitations of the process as much as the quality of the underlying data.
The institute needed a system that could handle the entire retrieval and synthesis workflow — autonomously, across all relevant sources, simultaneously — and return outputs that were not just fast but traceable. Researchers needed to trust the outputs, which meant every insight had to come with its source. A system that produced confident-sounding summaries without clear attribution would be worse than useless in a scientific context.
The technical challenge was substantial. Building an agent that could decompose an open-ended biomedical research prompt into discrete retrieval tasks, route those tasks to the right data sources, maintain reasoning consistency across a multi-step workflow, and synthesize coherent findings across structured APIs, spreadsheet data, and unstructured web content — all while managing session state and returning results through a clean research interface — required architectural sophistication well beyond a simple API wrapper.
Aegasis Labs designed and built CalibrBV as a modular, multi-agent AI system — architected specifically for the complexity of multi-source biomedical research. Every component was designed around two requirements that don’t always coexist comfortably: speed and traceability. Fast results that researchers can’t verify are a liability. Traceable results that take too long to generate don’t get used. CalibrBV delivers both.
Supervisor Agent — Orchestrating the Research Workflow
At the top of the architecture sits a Supervisor Agent responsible for decomposing complex research prompts into granular, routable tasks. When a researcher submits a query — anything from a targeted drug pipeline question to a broad competitive landscape analysis — the Supervisor breaks it into discrete sub-tasks, determines which data sources are relevant to each, and routes them to the appropriate agents in parallel.
This orchestration layer maintains stateful, multi-step reasoning consistency across the entire workflow. Tasks don’t execute in isolation; the Supervisor ensures that intermediate findings inform downstream retrieval and that the final synthesis reflects the full research picture, not just the last query result.
Custom Research Agent — Multi-Source Retrieval and Synthesis
The Research Agent handles the actual data work. It connects to GlobalData APIs — covering drugs, clinical trials, deals, companies, and filings — pulls structured insights from Citeline spreadsheets using Pandas-based parsing, and accesses public research sources through a GPT Researcher-powered layer for open literature and web data.
Critically, the agent doesn’t just retrieve data — it classifies task intent, retrieves relevant content, and synthesizes results with contextual references before returning output to the Supervisor. Model Context Protocol (MCP) is implemented throughout to enrich research context and ensure data traceability. Every finding comes with its source. Researchers know not just what the system found, but where it came from.
The underlying reasoning layer uses a React-based design — reasoning and action chaining — allowing the agent to make intermediate inferences, adjust retrieval based on partial findings, and handle the kind of multi-step reasoning that complex biomedical queries require.
FastAPI Backend — Orchestration Hub
The backend is built on FastAPI, serving as the central orchestration hub managing user sessions, API requests, and research flows end to end. Endpoints handle task submission, result retrieval, and structured research summaries. Redis manages session state and task caching, maintaining conversational context across multi-step research flows and improving responsiveness for iterative queries.
The Spreadsheet Agent — Citeline Data Layer Handles the proprietary, internally maintained Citeline datasets that GlobalData doesn’t cover. Using Pandas-based parsing, this agent extracts structured insights from CSV and XLSX files, normalizes the output into a format consistent with other agent outputs, and feeds it into the synthesis layer. The Citeline data is often where the most institution-specific insight lives — this agent makes it machine-readable and query-responsive.
The Web Research Agent — Open-Source Intelligence For queries that extend beyond proprietary datasets into the broader scientific literature and public biomedical resources, this agent deploys a GPT Researcher-powered layer for open-source data exploration. It retrieves, reads, and summarizes relevant public findings — adding the breadth of published research to the depth of structured proprietary data.
The Synthesis Agent — Multi-Source Report Assembly The output layer. Once the Supervisor confirms that all subtask results are complete, the Synthesis Agent aggregates findings from all active research agents, resolves conflicts or gaps in the data, and generates a structured, human-readable research summary. Reports include contextual references, citation-level traceability for each claim, and scientific language calibrated to the domain. This is the agent that turns raw multi-source data into something a researcher can act on immediately.
GPT-4 for Biomedical Reasoning and Generation
The ML layer is powered by OpenAI GPT-4, integrated with custom prompting pipelines tailored for biomedical reasoning, language generation, and synthesis. The system doesn’t apply a general-purpose language model to a specialized domain without adaptation — the prompting architecture was designed specifically for scientific context, ensuring that summaries are accurate, appropriately hedged, and grounded in the retrieved source material rather than model priors.
Session and State Management — Redis-Backed Continuity Manages conversational state across multi-step research flows. Researchers can iterate on a query, drill deeper into a finding, or pick up a research thread across sessions — without losing context. Redis caching also improves platform responsiveness under concurrent research loads, keeping the experience fluid even when the underlying orchestration is processing multiple parallel subtasks.
The platform is a production research environment, engineered for real scientific workloads.
Technologies
The CalibrBV engagement followed Aegasis Labs’ four-step delivery approach, designed to translate a complex technical brief into a production system without losing scientific precision along the way:
CalibrBV launched as a production platform serving active biomedical research teams. The outcomes were concrete.
A conventional AI assistant connected to a few APIs might have automated part of this workflow. The retrieval part, perhaps. Or the summarization. Not both, reliably, across multiple heterogeneous data sources, with source traceability at the claim level.
The multi-agent architecture solved the problem that a single-model approach couldn’t: the data landscape CalibrBV needed to navigate isn’t uniform. GlobalData APIs, Citeline spreadsheets, and public web sources each require different retrieval logic, different parsing strategies, and different quality standards for what counts as a usable output. Specialized agents handle each source on its own terms — then the Supervisor and Synthesis Agent bring those outputs into coherence.
That decomposition is also what makes the platform trustworthy. When each agent owns a discrete, testable scope, the system’s behavior is auditable at every stage. When a report’s claim is questioned, the trace runs back through synthesis, through retrieval, to the source. That chain of accountability is built into the architecture — not reconstructed after the fact.
CalibrBV is the kind of problem Aegasis Labs is built for: a domain with deep expertise requirements, a data landscape that defies simple automation, and end users — researchers — whose time is too valuable to spend on information retrieval.
The multi-agent architecture didn’t just automate the workflow. It made the workflow trustworthy, scalable, and genuinely useful for scientists doing serious work.
Aegasis Labs has delivered 70+ projects across 8+ industries — including healthcare, biotechnology, and enterprise SaaS — with a 97% project completion success rate and 95% client satisfaction. Our 30+ consultants bring top 1% technical expertise across AI architecture, multi-agent systems, and cloud engineering. We’re a certified partner of both AWS and Microsoft.
If your research or operations team is spending too much time gathering data and not enough time acting on it — we’d like to show you what intelligent automation can do.