Notes on Markdown, LLM context, and browser tooling.

13 min readNotionMarkdownRead

Importing Markdown into Notion Without Losing Formatting

A practical guide to importing Markdown into Notion: what paste converts to blocks, what the file importer keeps, and how to avoid the lossy parts.

ComparisonsJun 2, 2026

Markdown vs JSON vs Text for LLM Context

When to feed an LLM Markdown vs JSON vs plain text: token density, reasoning reliability, and the rule for picking a format per content type.

14 min readLLM contextMarkdownRead

13 min readObsidianMarkdownRead

Obsidian Frontmatter for Web Clipping, Done Right

Clip web pages into Obsidian with clean YAML frontmatter and Properties: which fields to keep, the obsidian://new URI scheme, and Dataview-friendly metadata.

LLM ContextJun 2, 2026

Packaging a Web Corpus for AI Agents to Ingest

Turn clipped pages into an agent-ready web corpus — a folder of Markdown plus an index and a manifest.json with file, title, URL, and token counts.

15 min readLLM contextRAGRead

14 min readLLM contextMarkdownRead

Send a Web Page to ChatGPT, Claude, Perplexity

How to send a web page to ChatGPT, Claude, and Perplexity as clean Markdown context instead of a raw URL — paste vs link, per-tool quirks, and a one-click flow.

12 min readLLM contextMarkdownRead

YouTube Transcripts to LLM Context: A Clean Markdown Flow

Turn YouTube transcripts into clean, low-token Markdown LLM context: strip timestamps and filler, then feed Claude or ChatGPT a citable source.

WorkflowMay 26, 2026

Building a Personal RAG with BulkMD Markdown Output

A reproducible 200-line personal RAG pipeline — capture with BulkMD, chunk on Markdown headings, embed with OpenAI, retrieve with LanceDB, answer with Claude.

11 min readRAGMarkdownRead

WorkflowMay 26, 2026

Building an Obsidian Knowledge Base from Web Pages

A reproducible workflow for turning your read-it-later list into a structured Obsidian vault — frontmatter, folder shape, and linking patterns.

11 min readObsidianNotionRead

11 min readManifest V3Service workerRead

chrome.storage Patterns for Manifest V3 Extensions

When to use chrome.storage.session, .local, .sync, or IndexedDB in a Manifest V3 extension — quotas, throughput, and a practical layout for queue-heavy work.

WorkflowMay 26, 2026

Building a Claude Code Knowledge Base from Web Docs

A reproducible workflow for turning any documentation site into a local Markdown knowledge base that Claude Code, Cursor, and other coding agents can index.

11 min readClaudeChatGPTRead

Cost & PerformanceMay 26, 2026

Claude Model Routing: Haiku vs Sonnet vs Opus for RAG

When to call Haiku, Sonnet, or Opus in a RAG pipeline — a measured comparison of cost, latency, and answer quality across the Claude 4.x lineup in 2026.

11 min readClaudeCost optimizationRead

LLM ContextMay 26, 2026

How Google AI Overviews Pick Citations in 2026

What gets surfaced in AI Overviews — the signals Google uses to choose citations, why semantic HTML beats keywords, and what to fix this week.

12 min readSEOLLM contextRead

LLM ContextMay 26, 2026

How to Write an llms.txt File for AI Search in 2026

A practical guide to authoring llms.txt — the emerging standard that tells ChatGPT, Claude, Perplexity, and Google AI Overviews what your site is about.

13 min readLLM contextSEORead

13 min readManifest V3Service workerRead

Manifest V3 Service Workers for Bulk URL Processing

Engineering patterns for a Chrome extension that survives service-worker restarts mid-job — queue persistence, tab pools, alarms, and what holds up at scale.

LLM ContextMay 26, 2026

How AI Agents Read Markdown Context in 2026

How Claude, ChatGPT, Cursor, and Perplexity actually parse Markdown — what they cite, what they drop, and how to structure pages for higher answer quality.

13 min readLLM contextMarkdownRead

Cost & PerformanceMay 26, 2026

OpenAI vs Voyage vs Cohere Embeddings: 2026 RAG Benchmark

Three embedding-model families compared on a Markdown-corpus RAG task — retrieval quality, cost per million tokens, dimensions, and which fits which workload.

11 min readRAGTokensRead

Anthropic Prompt Caching + Markdown: 90% Cost Reduction

How pairing Anthropic prompt caching with clean Markdown context drops repeat-query costs to ~10% of baseline — with reproducible numbers from a real workflow.

12 min readCost optimizationTokensRead

12 min readReadabilityTurndownRead

Readability vs Trafilatura vs jsdom: 2026 Benchmark

A measured comparison of three HTML content extractors across 50 real pages — extraction fidelity, runtime, edge cases, and which fits a browser extension.

11 min readWeb scrapingChrome extensionRead

Server Scrapers vs Browser Extensions: 2026 Tradeoffs

When server-side scraping APIs win, when a browser extension wins, and the four metrics — latency, auth coverage, cost, rate-limit risk — that decide it.

11 min readManifest V3Content scriptRead

Handling SPA Pages in a Manifest V3 Content Script

Why naive document_idle injection fails on Next.js, React, and Vue apps — and the MutationObserver-and-quiescence pattern that reliably waits for hydration.

Cost & PerformanceMay 26, 2026

Token Math by Content Type: Code, Tables, Lists in 2026

How prose, code, tables, lists, and JSON tokenize differently in 2026 — the per-byte token cost of each content type, and where Markdown compresses best.

11 min readTokensCost optimizationRead

12 min readTurndownPandocRead

Turndown vs Pandoc vs marked: Serializer Benchmark

Three HTML-to-Markdown serializers compared on the same 50 pages — output fidelity, GFM coverage, runtime, and which one fits a browser-side pipeline.