--- id: 4c5b9fcd-feac-4cce-8773-6a9f745858e0 type: fix title: "Clean PDF extraction text for Foundry textareas" tags: [vagabond-rpg, foundryvtt, pdf-extraction, fix] importance: 0.5 confidence: 0.8 created: "2025-12-18T19:34:33.879088+00:00" updated: "2025-12-18T19:34:33.879088+00:00" --- When extracting text from PDFs for Foundry VTT compendiums: 1) Strip HTML tags since textareas show raw text not rendered HTML, 2) Fix mid-sentence line breaks from PDF column formatting by joining lines that don't end in sentence punctuation (.\!?:). This ensures descriptions display cleanly in sheet textareas.