How does duplicate detection work if the files are different sizes?

Perceptual hashing compares the visual content of images, not the file bytes. Two images that look the same but differ in resolution, compression, or format will still be detected as duplicates.

Can it find near-duplicates, not just exact copies?

Yes. The similarity threshold controls how closely images must match. Lower thresholds catch more variations (different crops, slight edits). Higher thresholds are stricter and match only very similar images.

How many images can I scan at once?

There is no hard limit. The tool compares all uploaded images against each other. Performance depends on the number of images and your hardware. Hundreds of images are handled quickly; thousands may take longer.

Utilities

Find Duplicates

Scan a set of images to find exact duplicates and near-duplicates using perceptual hashing. Detects similar images even after resizing, recompression, or minor edits. Helps clean up photo libraries and identify redundant assets.

Deploy with Docker View Source

Features

Perceptual hashing for near-duplicate detection
Detects duplicates even after resize, crop, or recompression
Adjustable similarity threshold
Groups duplicates for easy review
Shows file size, dimensions, and format for each match

What you can do

Clean up photo libraries by finding and removing duplicate files
Detect near-duplicates in large image datasets
Identify redundant product images in an asset library
Find similar images across different folders and sources

Self-hosted. Your images never leave your network.

SnapOtter runs entirely on your own infrastructure. Images processed with Find Duplicates are never uploaded to third-party servers. Deploy a single Docker container and process images with full privacy, no watermarks, and no usage limits. Open source under AGPL-3.0.

Frequently asked questions

How does duplicate detection work if the files are different sizes?: Perceptual hashing compares the visual content of images, not the file bytes. Two images that look the same but differ in resolution, compression, or format will still be detected as duplicates.
Can it find near-duplicates, not just exact copies?: Yes. The similarity threshold controls how closely images must match. Lower thresholds catch more variations (different crops, slight edits). Higher thresholds are stricter and match only very similar images.
How many images can I scan at once?: There is no hard limit. The tool compares all uploaded images against each other. Performance depends on the number of images and your hardware. Hundreds of images are handled quickly; thousands may take longer.

More Utilities tools

Image Info

View all metadata and image properties

Learn more Image Compare

Side-by-side comparison of two images

Learn more Color Palette

Extract dominant colors from image

Learn more QR Code Generator

Generate styled QR codes with custom colors, patterns, and logos

Learn more

Ready to try Find Duplicates?

Deploy SnapOtter in under a minute. All 50+ tools included. Open source and free forever.

Get Started View Pricing