Utilities
Find Duplicates
Scan a set of images to find exact duplicates and near-duplicates using perceptual hashing. Detects similar images even after resizing, recompression, or minor edits. Helps clean up photo libraries and identify redundant assets.
Features
- Perceptual hashing for near-duplicate detection
- Detects duplicates even after resize, crop, or recompression
- Adjustable similarity threshold
- Groups duplicates for easy review
- Shows file size, dimensions, and format for each match
What you can do
- Clean up photo libraries by finding and removing duplicate files
- Detect near-duplicates in large image datasets
- Identify redundant product images in an asset library
- Find similar images across different folders and sources
Self-hosted. Your images never leave your network.
SnapOtter runs entirely on your own infrastructure. Images processed with Find Duplicates are never uploaded to third-party servers. Deploy a single Docker container and process images with full privacy, no watermarks, and no usage limits. Open source under AGPL-3.0.
Frequently asked questions
- How does duplicate detection work if the files are different sizes?
- Perceptual hashing compares the visual content of images, not the file bytes. Two images that look the same but differ in resolution, compression, or format will still be detected as duplicates.
- Can it find near-duplicates, not just exact copies?
- Yes. The similarity threshold controls how closely images must match. Lower thresholds catch more variations (different crops, slight edits). Higher thresholds are stricter and match only very similar images.
- How many images can I scan at once?
- There is no hard limit. The tool compares all uploaded images against each other. Performance depends on the number of images and your hardware. Hundreds of images are handled quickly; thousands may take longer.
Ready to try Find Duplicates?
Deploy SnapOtter in under a minute. All 50+ tools included. Open source and free forever.