Full-Text Ingestion
Input Priority
Section titled “Input Priority”ingest_paper_fulltext accepts any of:
doipaper_urlpdf_urllocal_pdf_path
You must pass at least one.
Parse Modes
Section titled “Parse Modes”auto(default): tries parser fallback chaingrobid: force GROBID pathsimple: force basic parsing fallback
In auto mode, ScholarMCP now attempts grobid -> simple.
Common Failure Cases
Section titled “Common Failure Cases”- DOI page has no downloadable PDF URL:
- retry with
pdf_urlorlocal_pdf_path
- retry with
- Remote downloads disabled:
- set
RESEARCH_ALLOW_REMOTE_PDFS=true
- set
- Local ingestion disabled:
- set
RESEARCH_ALLOW_LOCAL_PDFS=true
- set
- Throttling or timeout pressure:
- increase
SCHOLAR_REQUEST_DELAY_MSand/orRESEARCH_TIMEOUT_MS
- increase