Skip to content

ScrapalotYour Documents, Supercharged

Stop searching. Start asking. Turn any document into an intelligent conversation partner.

The Problem We're Solving

Research is drowning in documents. Literature reviews take weeks. Finding connections between papers is manual detective work. Your team's collective knowledge stays siloed in individual folders.

The real challenge isn't finding information—it's understanding it. You need to:

  • Synthesize insights across dozens or hundreds of papers
  • Track citations and relationships between documents
  • Collaborate with teammates on shared research
  • Answer complex questions that span multiple sources

Scrapalot transforms document chaos into structured knowledge. It's not just search—it's a research intelligence platform.

Who Uses Scrapalot?

Researchers & Scientists

"I uploaded 50 research papers and asked it to compare methodologies across studies. Got a comprehensive answer in seconds with exact citations."

Perfect for literature reviews, hypothesis generation, grant writing, and discovering hidden connections across research domains. Supports academic connectors (Google Scholar, Semantic Scholar, arXiv) for direct paper import.

Academic Institutions

"Our PhD students complete literature reviews in days instead of months. The knowledge graph reveals relationships between papers we never noticed."

Accelerate research workflows, enable cross-departmental collaboration, and build institutional knowledge bases that persist beyond individual projects.

R&D Teams

"We centralized 5 years of internal research reports. Now our scientists can query across all past experiments instantly."

Turn scattered research into searchable, queryable knowledge. Find prior art, avoid duplicating experiments, and build on past discoveries.

Engineering Teams

"Our technical documentation finally makes sense. New engineers onboard faster, and senior devs answer questions without context-switching."

Knowledge sharing without the bottleneck. Your documentation becomes a conversational partner, not a maze of markdown files.

How It Actually Works

Step 1: Upload & Connect Drag and drop PDFs, EPUBs, Word docs, or connect Google Drive, Dropbox, Notion. Search academic databases directly (Google Scholar, Semantic Scholar, arXiv). We'll read everything—even scanned documents and images.

Step 2: Ask Anything
"What were the main findings in the Q3 reports?"
"Compare the pricing models mentioned in these contracts."
"Summarize all the security requirements across our documentation."

Step 3: Get Smart Answers
Not just keyword matches—actual understanding. With citations, so you can verify everything.

The Secret Sauce

We use more than 10 perfectly crafted intelligent search strategies, and let an AI agent automatically select the optimal approach for your question.

Three-Way Search Fusion:

  • Semantic understanding (meaning-based search)
  • Keyword matching (exact phrase finding)
  • Relationship mapping (connection discovery)

Smart Context System:

  • Hierarchical document structure (Workspace → Collection → Book → Chapter → Section)
  • Intelligent context expansion from snippets to full sections
  • 70% reduction in incorrect answers vs. traditional systems

Advanced Processing:

  • 17+ document processing strategies for optimal accuracy
  • Intelligent chunking that preserves meaning across documents

Knowledge Graph Enhancement:

  • Automatic entity and relationship extraction
  • Cross-document connection discovery
  • 25% better answers for entity questions, 40% for relationship queries

You don't need to understand the internals—it just works. But for the curious, we've documented our architecture in detail.

Pricing That Makes Sense

Researcher Plan - Free

Perfect for individual researchers and students.

Desktop App (Fully Open Source)

  • Download and run locally
  • Full RAG capabilities
  • All agentic strategies and orchestrators
  • Use any AI model (OpenAI, local, etc.)
  • Your data stays on your machine

Cloud Version (Free Tier)

  • Managed hosting - no setup required
  • Single-user workspace
  • Academic connectors (Google Scholar, arXiv, Semantic Scholar)
  • Generous storage and usage limits

Get Started →

Professional Plan - $19/month

For research teams and small organizations.

  • Up to 5 team members
  • Shared workspaces and collections
  • Team collaboration features
  • Priority support
  • Cloud storage included

Enterprise Plan - Custom Pricing

For large institutions and organizations.

  • On-premise deployment to your cloud infrastructure
  • Unlimited users and workspaces
  • SSO integration (SAML, LDAP)
  • Dedicated support and SLAs
  • Custom features and integrations
  • Professional services available

Transparent Pricing Model

Researcher Plan: Fully open source desktop app. Free cloud tier for individual academics.

Professional/Enterprise: Proprietary team collaboration and multi-user management code. Deploy to your own infrastructure or use our managed cloud.

Your research data is always yours - export and migrate anytime.

What Makes This Different?

Research-Grade Intelligence

Most tools do keyword matching. Scrapalot uses agentic RAG with 13 strategies, tri-modal fusion (semantic + lexical + graph), and knowledge graphs. Ask "What methodologies do these papers share?" and get synthesis across sources with exact citations.

Built by Researchers, For Researchers

We built this because literature reviews were painful. Spending weeks reading papers, manually tracking citations, losing insights in scattered notes. If it frustrated us, we fixed it. The result: a research intelligence platform, not just document search.

Open Source Desktop App

The desktop version is fully open source (MIT licensed). All RAG strategies, chunking methods, and core features are available. Fork it, modify it, use it however you want. We monetize through managed cloud hosting and enterprise team features, not by restricting research capabilities.

Technical Depth Without the Complexity

Under the hood: Advanced search algorithms, intelligent routing, three-way fusion, context expansion, knowledge graphs, and 17+ processing strategies. On the surface: a simple chat interface. You get research-grade AI without needing a technical background.

Next Steps

Just Want to Try It?
Quick Start → - 10 minutes to your first answer

Want to Understand How It Works?
Architecture Deep Dive → - For the technically curious

Need Help?
FAQ → - Common questions
GitHub Discussions - Ask the community
Email Us - We actually respond


Star on GitHub · Join Discord · Read the Blog

Released under the MIT License.