PDF/image β page-by-page review. DOC/XLSX/TXT/MD/HTML β chunks-only quick review.
Crawler dΓΉng trafilatura cho main content (strip nav/ads). JS-heavy SPA: tick force JS rendering.