Work with a user's Semantic Scholar account to read Research Feeds, inspect private Library folders, add papers to folders, and resolve Semantic Scholar paper records from identifiers such as arXiv IDs.
Add this skill
npx mdskills install zjsxply/semantic-scholar-library-feedClear CLI-focused workflow for authenticated Semantic Scholar operations with good cookie management patterns
1---2name: semantic-scholar-library-feed3description: Work with a user's Semantic Scholar account to read Research Feeds, inspect private Library folders, add papers to folders, and resolve Semantic Scholar paper records from identifiers such as arXiv IDs.4---56# Semantic Scholar Library & Feed78## Overview910Use this skill to work against Semantic Scholar's authenticated Library and Research Feed surfaces without browser-driven login flows. Prefer the bundled CLI for Cookie Store inspection, browser-curl import, SSR probing, feed pagination, folder inspection, folder writes, and Graph `paper/batch` lookups.1112## Quick Start13141. Check whether the fixed Cookie Store already exists:1516```bash17python3 scripts/semantic_scholar_cli.py cookie-summary18```19202. If the Cookie Store is missing or stale, ask the user to copy an authenticated Semantic Scholar request as curl from browser DevTools, then import it:2122```bash23python3 scripts/semantic_scholar_cli.py import-curl \24 --curl-file /tmp/semantic-scholar-request.sh25```2627`import-header` is still available when the user already extracted only the raw Cookie header.28293. Check cookie health:3031```bash32python3 scripts/semantic_scholar_cli.py cookie-summary33```34354. Choose the task module below.3637## Cookie Store3839Save Semantic Scholar auth state under:4041- `~/.auth/semantic-scholar.cookies.json`42- `~/.auth/semantic-scholar.cookie-header.txt`4344Treat `sid` and `s2` as the minimum required cookies for private Library and Feed access. If either is missing, ask the user for a fresh browser-copied curl and re-import it before touching private endpoints.4546Read [references/auth-and-cookies.md](references/auth-and-cookies.md) when you need the Cookie Store workflow or the curl import format.4748## Task Modules4950### Research Feed5152Use this path when the user wants feed export, history crawl, or local coarse filtering.53541. Probe SSR if you need to inspect `var DATA`:5556```bash57python3 scripts/semantic_scholar_cli.py ssr-dump --list-names58```59602. Crawl the feed through `/api/1/library/folders/recommendations`:6162```bash63python3 scripts/semantic_scholar_cli.py feed-crawl \64 --output /tmp/research-feed.json65```66673. Persist output after every window. Do not wait for the entire crawl to finish.6869Read [references/research-feed.md](references/research-feed.md) when you need the SSR decode order, the real API path, or the pagination stop rules.7071### Library Folder7273Use this path when the user wants folder contents, folder diffs, or bulk add operations.74751. Export a folder:7677```bash78python3 scripts/semantic_scholar_cli.py folder-entries \79 --folder-id 13895811 \80 --all-pages \81 --output /tmp/folder.json82```83842. Add a paper to a folder:8586```bash87python3 scripts/semantic_scholar_cli.py folder-add \88 --paper-id 25f612200a3821c71b99819cd671f2e60df5b470 \89 --paper-title 'AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent' \90 --folder-ids 1389581191```9293Read [references/library.md](references/library.md) when you need endpoint behavior, `pageSize` limits, or the verified `entries/bulk` request shape.9495### Paper ID Resolution9697Use Graph `paper/batch` when BibTeX already contains stable identifiers such as arXiv IDs.9899```bash100python3 scripts/semantic_scholar_cli.py graph-batch \101 --ids ARXIV:2602.08234,ARXIV:2602.12670102```103104Prefer this over search-page scraping when possible.105106## Operating Rules107108- Prefer direct HTTP after cookies exist.109- Do not use Playwright for login. If the Cookie Store is missing, ask the user to copy an authenticated browser request as curl and import it.110- Use Playwright only when rendered-page behavior or network inspection is needed to discover hidden interfaces.111- Treat browser clicking as a reconnaissance step, not the main extraction path.112- For feed history, stop on `empty days`, missing `nextWindowUTC`, or repeated windows.113- For folder sync, resolve `paperId` first, diff against existing folder entries, then call `folder-add`.114- If the task depends on a private page and returns `401`, ask for a fresh browser-copied curl and refresh the Cookie Store before debugging the endpoint.115
Full transparency — inspect the skill content before installing.