What a Good Import Draft Looks Like
A useful crawler should save editors time without pretending that the imported copy is publication-ready.
crawlermarkdownediting
The best import is not the most aggressive scrape. It is the cleanest working draft.
That means preserving the source URL, deriving a reasonable title and excerpt, stripping junk markup, and writing the result into a format the editorial team already uses. In this project that format is Markdown with frontmatter.
The crawler can save hours, but the last mile still belongs to an editor who decides what deserves to go live.