OpenArx

Scientific knowledge infrastructure for AI agents

Search, explore, and connect research papers through MCP. Built for LLMs. Open for everyone.

Under Construction
2,937
papers indexed
167,592
semantic chunks
104
categories

Papers With Code — the only structured database of AI/ML research — was shut down in July 2025. 500,000 papers and 1,500 benchmarks gone overnight.

Meanwhile, LLM agents need reliable access to scientific knowledge. Current APIs are rate-limited, lack full-text access, and none offer MCP integration.

No scientific publisher has built an MCP server. We did.

First scientific MCP server

Connect your LLM to research papers in seconds. 4 tools: search, get_document, find_related, find_code. Hybrid search: BM25 + SPECTER2 + Gemini embeddings.

Knowledge, not documents

We don't just store PDFs. Each paper is decomposed into semantic chunks — individual ideas and claims — using LLM-powered analysis.

Open infrastructure

Open source. Open data. Open science. Apache 2.0 license. No paywalls. No vendor lock-in. A knowledge layer any AI application can build on.

Connect in seconds

Add to your MCP client configuration and start searching.

{ "mcpServers": { "openarx": { "url": "https://mcp.openarx.ai/mcp" } } }
ToolDescription
searchHybrid search across papers — combines keyword and semantic matching
get_documentRetrieve full paper details by arXiv ID
find_relatedDiscover similar papers via vector similarity
find_codeFind GitHub repos and datasets linked to a paper

What's next

IdeaRank — connecting scientific ideas across papers, not just citations
Self-publishing — publish your research, get indexed instantly
Public API — REST endpoints for developers
Growing the base — indexing towards full arXiv coverage