Shakvaro.
ScreenMind
Chrome extension · v0.3.0 · MIT licensed

Capture. Extract. Act.

Smart screenshot tool that captures, runs local OCR, classifies the content, and suggests the right action. Four capture modes, eight collections, zero servers.

100% on-deviceNo accountsNo telemetryWorks offline

Four ways to capture

Every screenshot Chrome can take. None it can't.

Visible tab

Snap what's on screen right now. One keystroke, one shot.

Area select

Drag a rectangle. Pixel-precise crop, snaps to your cursor.

Full page

Scroll-stitched capture of the entire page — header to footer.

Screen / window

Native desktop picker. Capture any app, any monitor.

What's inside

A screenshot tool that thinks for you

Local OCR via Tesseract WASM

Text extracted on-device in an offscreen document. Images are preprocessed and tiled vertically for tall captures. No cloud round-trip, works offline.

Auto content detection

Confidence-scored tagging into code, prices, contacts, tasks, UI, plain text, or mixed — so the right action shows up first.

Context-aware actions

Copy code, save lead, set reminder, create task — actions adapt to what you captured and your active mode.

Search OCR + metadata

Find any past screenshot by the words inside the image, the page title, the URL, or your tags.

Collections out of the box

Code Snippets, UI Inspiration, Prices to Track, Leads, Tasks, Reading Notes, Job Opportunities, Personal Archive — all preseeded.

Side-panel library

Grid view, multi-select, type filters, viewer with action bar. Everything stored locally in IndexedDB.

Keyboard-first

⌘⇧S for visible tab, ⌘⇧A for area. Right-click anywhere for context-menu capture options.

Smart tagging

Regex detectors plus content-type heuristics auto-tag each capture — so search is instant, even months later.

Resume-on-startup

If a long OCR job is interrupted, ScreenMind picks it back up on next launch. No half-processed captures left behind.

The pipeline

Capture → Extract → Act. Every time.

01

Capture

Pick a mode — visible tab, area, full page, or screen. Hotkey or right-click, your choice.

02

Extract

Tesseract runs in an offscreen document. Image preprocessed for contrast, tiled for tall pages, OCR'd locally.

03

Act

Content type is detected. The right action — copy code, save lead, create task — surfaces first. Take it or file it.

Suggested actions

The right next step, already queued up

Snap a code block — "Copy code" is first. Snap a contact card — "Save lead" is first. Switch modes and the same captures get a different ranked menu. No menu-hunting.

  • Auto-detects code, prices, contacts, tasks, UI, plain text
  • Confidence-scored tagging powers search and filters
  • Mode-based re-ranking (Dev / Business / Shopper / Research)
  • Right-click context menu mirrors the side-panel actions
ScreenMind
Code · 94%
const ip = req.headers.get(
"x-vercel-forwarded-for"
)?.split(",")[0]?.trim();

Suggested actions

  • Copy code
  • Save to Code Snippets
  • Create task
  • Add tag

Modes

Tune ScreenMind to what you actually do

Dev

Boosts Copy Code and Create Task. Default mode for engineers.

Business

Boosts Save Lead and Create Task. Contact cards become leads in one click.

Shopper

Boosts Copy Text and Create Task. Price captures get tracked instantly.

Research

Boosts Copy Text and Add to Collection. Reading notes pile up clean.

Your screenshots never leave your machine.

OCR runs locally via Tesseract WASM in an offscreen document. The extension makes zero outbound requests — verified by grep across the source. No analytics, no auth, no remote config. Captures live in your browser's IndexedDB and are deletable from the side panel in two clicks.

No telemetry
No accounts
MIT, open source

Keyboard

Two keys, two captures

Capture visible tab
S
Capture area
A

Or right-click → ScreenMind for the context menu

Questions

The honest answers

No. OCR runs locally via Tesseract WASM inside an offscreen document. There are no fetch calls to external hosts in the extension — only chrome.runtime.getURL for bundled assets. No telemetry, no analytics, no remote config. Audit the source to verify.

In your browser's IndexedDB via Dexie. Image blobs, thumbnails, OCR text, tags, and collections — all on your machine. unlimitedStorage is requested because screenshots are bulky. Clear the library from the side panel any time.

Tesseract.js v7 with English. Preprocessing (grayscale, contrast, binarize) plus vertical tiling lifts accuracy on long pages and small UI fonts. It's not a cloud OCR, so screenshots of stylized fonts or rotated text are weaker. For most code, prices, and contact info it's solid.

Modes re-rank the suggested actions. Dev surfaces 'Copy Code' first, Business surfaces 'Save Lead', Shopper surfaces price actions, Research surfaces collection adds. Same captures, different first-action.

Yes — host_permissions: <all_urls> means capture works on any page you visit. Chrome's own internal pages (chrome://) are sandboxed for everyone and can't be screenshotted by any extension.

Free. MIT licensed. Source on GitHub. No accounts, no paid tier.

ScreenMind

Free · MIT · v0.3.0

Stop hunting through screenshots. Start using them.

One click from the Chrome Web Store. No account, no setup, no upsell.