Image and Video Handling

Industry-specific file types and leveraging previous work and workflows from yourself and others to avoid creating new one-offs Today’s Small, Good Thing involved a CD with MRI images from a radiologist’s office. It’s labeled “For Physicians Only,” and while I’m not a doctor, I’ve played one on IRC. The files didn’t open with the default…

elasticsearch and fscrawler

Great-to-Haves — Handled elasticsearch with fscrawler handled these well for my needs. Low-touch / unattended ingestion Sensible, configurable defaults Helpful, accurate dry runs Non-catastrophic re-runs (i.e. smart enough to minimize overwriting or duplicating existing entries) Clever de-duping Customizable / scriptable input and output handling File meta data capture Full-Text indexing of file content “Clever de-duping”…