Enterprise Apps Infrastructure Technology Vendors

elasticsearch and fscrawler

Great-to-Haves — Handled elasticsearch with fscrawler handled these well for my needs. Low-touch / unattended ingestion Sensible, configurable defaults Helpful, accurate dry runs Non-catastrophic re-runs (i.e. smart enough to minimize overwriting or duplicating existing entries) Clever de-duping Customizable / scriptable input and output handling File meta data capture Full-Text indexing of file content “Clever de-duping”…