PM Case Study

Building Sifty AI: From Personal
Frustration to Shipped Product

Identifying a gap in photo management, designing an AI-powered solution, and shipping it to the Play Store — from research through launch.

TL;DR

•Personal pain point (8,000+ photos, zero motivation to sort) turned into a shipped product
•End-to-end ownership: research, strategy, design, development, and launch
•Google Gemini LLM for multimodal photo analysis with composite relevance scoring
•On-device AI descriptions power a keyword search feature no competitor offers
•Live on Google Play Store

Status

Live on Play Store

Stack

Flutter, Gemini, SQLite

Platform

Android & iOS

The Problem

8,000+ photos, a decade of accumulation, and zero motivation to sort

I had over 8,000 images on my phone accumulated over nearly a decade. Screenshots of things I'd already dealt with. Food photos I'd never look at again. Dozens of nearly identical shots from the same moment. Memes. Accidental photos. Images that made sense at the time but were digital clutter months later.

The problem wasn't that I lacked tools to delete photos. The problem was that deciding what to keep vs. what to delete is exhausting. Every photo requires a micro-decision. Multiply that by 8,000 and you understand why most people never start.

I searched online for a solution. Every app I found — gallery cleaners, duplicate finders, storage managers — optimized the act of deletion. A better delete button. A faster swipe interface. Bulk select. But none of them touched the real bottleneck: the cognitive load of the decision itself.

The bottleneck was never the delete button — it was the 8,000 decisions required to reach it.

Market Landscape

What exists and why it falls short

I downloaded and tested over 10 photo management apps before building Sifty. Here's what I found.

Google Photos

What it does

Smart storage, compression, “Memories” that resurface old photos

Where it falls short

Doesn't help you decide what to keep. Resurfaces memories but doesn't declutter.

Gallery Cleaner Apps

What it does

Files by Google, Cleaner for iPhone — cache/junk file removal, simple delete UI

Where it falls short

Still requires you to make every individual decision. The cognitive load is unchanged.

Duplicate Finders

What it does

Detect and remove exact or near-duplicate photos

Where it falls short

Solves one narrow problem. Most clutter isn't duplicates — it's photos that outlived their purpose.

AI Photo Organizers

What it does

Categorize and tag photos by content, faces, locations

Where it falls short

Categorize and tag but stop at organization. They don't reduce the collection, and their search remains basic — limited to predefined categories rather than natural language descriptions.

Every existing solution shifts the UI around deletion. None of them reduce the cognitive load of the decision itself. That's the whitespace Sifty targets.

User Insights

Honest research, not fabricated data

I didn't commission a survey or fabricate statistics. Here's what my research actually looked like:

Personal Dogfooding

Used my own gallery of 8,000+ photos as the primary test case. You can't hide from your own frustrations when you're the user.

Friends & Family

Informal but revealing conversations. Everyone described the same problem. Nobody had ever tried to solve it systematically.

Competitive Analysis

Downloaded and tested 10+ gallery management apps. Documented what each did well and where every one fell short.

App Store Review Mining

Reviewed competitor app listings and user reviews. Users consistently appreciated the ease and speed of deletion these tools offered. But notably absent was the deeper frustration — nobody talked about how tedious it is to go through thousands of images. As if people didn't know this was a problem that could be solved.

Key patterns observed:

“Everyone I talked to said their gallery was a mess but none had tried to fix it.”

“People had emotional attachment to the idea of their photos even when they had no memory of what 80% of them contained.”

“The most common reason for not cleaning up: ‘I wouldn't know where to start.’”

“Existing apps required just as many decisions as manual sorting.”

Vision & Strategy

Eliminate the cognitive load of photo curation

The goal isn't maximum deletion — it's informed decisions. The AI should carry the weight of the decision, with the user confirming or overriding.

North Star Metric

Total images analyzed

Not “photos deleted.” If users find value, they run more photos through the app. A single metric that captures both adoption and engagement.

Product Principles

Decide for them, not just show them

The AI should carry the weight of the decision. Users confirm or override, not start from scratch.

Trust is built, not assumed

The learning-then-cleaning system, transparent reasoning, and safe trash bin all exist to earn user trust gradually.

Personal, not generic

Every user's definition of 'worth keeping' is different. The AI must learn individual preferences, not apply generic rules.

Privacy by architecture

Analysis stored on-device. No cloud uploads. Privacy isn't a feature toggle — it's how the system is built.

How this metric tells the full story

Images analyzed is both the adoption metric and the engagement metric. New users analyze their first batch. Satisfied users come back to run more. The number only grows when the product delivers real value — accurate recommendations, useful descriptions, and reclaimed storage that users can see.

Solution Design

A system that earns trust before it acts

Rather than analyzing everything at once, Sifty uses a progressive approach: first learn the user, then clean confidently.

LEARNING

Learning — Calibrating to You

The AI selects a random subset of photos from the gallery and processes them through Gemini. Each photo is analyzed and presented with a description and recommendation. As the user reviews each result — keep or delete — the system calibrates scoring weights specific to that user's preferences. This phase builds a personalized model of what matters to this person.

1.Random subset selected from gallery
2.AI presents recommendations with reasoning
3.User reviews and decides on each photo
4.Scoring weights calibrate to user preferences

CLEANING

Cleaning — Full Gallery Analysis

Using the calibrated weights from learning, the AI runs through the entire gallery. Each photo is analyzed, scored, and given a recommendation. During this process, rich text descriptions are generated for every image — these descriptions power the keyword search feature.

1.Custom weights applied across entire gallery
2.Rich text descriptions generated for every photo
3.Personalized keep/delete recommendations at scale
4.Descriptions stored locally for keyword search

Personalized Weight Calibration

Each photo receives a composite relevance score — not a binary keep/delete flag, but a weighted continuum personalized to the user. The scoring model starts with baseline weights and recalibrates during learning based on the user's actual decisions:

Content Analysis

What Gemini sees in the photo (people, places, objects, text)

Metadata Signals

Options considered

Freemium with limits, subscription, ad-supported, or completely free.

Decision

Completely free at launch

Why

Launching a consumer app in a crowded category with a paywall is a distribution problem. The priority was real usage data and word-of-mouth. Monetization is planned but gating the core experience before proving product-market fit would be premature.

PM Lens

Sequencing decisions correctly. Monetization is a strategy question, not a launch requirement.

Key Differentiator

AI-powered keyword search: find any photo by describing it

During analysis, Gemini generates a rich text description for every photo. These descriptions are stored locally on the device. This infrastructure byproduct became a standalone feature: semantic search across your entire gallery.

Example searches:

“passport photo”

“white pants at beach”

“sunset in Bali”

“receipt from dinner”

“cat sleeping on couch”

“kids at playground”

The descriptions needed for relevance scoring turned out to be a standalone product feature. Good PMs recognize when infrastructure creates unexpected product value.

Why this is different from Google Photos search

Google Photos search requires cloud processing and only works with cloud-stored photos. Sifty's search works entirely on-device, across your full native gallery, with descriptions enriched by context the AI learned about you. No internet required. No data leaves your phone.

Architecture

High-level technical architecture

A privacy-first architecture where all user data stays on the device.

Mobile App

Flutter / Dart

Gemini API

Photo analysis & descriptions

On-Device SQLite

User Story, scores, descriptions

Local Search

Keyword matching

FlutterDartGoogle GeminiSQLite

Launch & GTM

From code to Play Store

The go-to-market was intentionally lean — prove the product works before investing in paid acquisition.

Build

Product development

From initial prototype to production-ready app with learning and cleaning system, composite scoring, and keyword search.

Marketing

siftyai.com

Launched the product site to support the app — clear storytelling, honest positioning, and direct download links.

Launch

Play Store submission and ASO

Published on Google Play Store. Optimized listing with screenshots, feature descriptions, and targeted keywords.

Distribution

Organic growth

Word-of-mouth, portfolio showcase, and organic discovery. No paid acquisition at this stage — the priority is proving product-market fit.

Metrics & Impact

Real metrics, not vanity numbers

I'm committed to sharing honest metrics. These are early-stage numbers that will be updated as usage grows.

High

Accuracy After Training

1000+

Photos Analyzed Per Session

On-Device

Privacy-First Architecture

Personal usage results

In my own gallery of 8,000+ photos, Sifty helped me identify and remove thousands of images I'd been carrying for years — screenshots, accidental photos, memes, and duplicates. Beyond decluttering, the keyword search feature became something I use weekly to find specific photos without scrolling.

Reflections

What I learned and what's next

The hardest part was never the code

The most difficult decisions were product decisions: what to build, what to cut, how to sequence, and when to ship. Getting the technology to work was straightforward compared to getting the product right.

Dogfooding is the most honest user research

You can't hide from your own frustrations when you're the user. Every annoyance was a feature request. Every delight was validation.

The learning-then-cleaning system was the riskiest and most important decision

It would have been easier to ship a single-pass analyzer. The learning-first approach took longer to build but the quality difference is what makes Sifty work.

Build good foundations and unexpected features reveal themselves

Keyword search emerged from the analysis infrastructure. The descriptions needed for scoring became a product feature nobody planned for.

What's next

iOS App Store launch

Expanding to Apple's ecosystem

Freemium monetization

Premium search features, higher batch limits

Time-decay scoring

Photos lose relevance over time — factor this into recommendations

Smart albums

Auto-generated from AI descriptions and learned preferences

Try Sifty AI yourself

See the product behind this case study. Download Sifty AI free and let the AI learn what matters to you.

Download Sifty AI View Other Projects

Building Sifty AI: From PersonalFrustration to Shipped Product

8,000+ photos, a decade of accumulation, and zero motivation to sort

What exists and why it falls short

Google Photos

Gallery Cleaner Apps

Duplicate Finders

AI Photo Organizers

Honest research, not fabricated data

Eliminate the cognitive load of photo curation

Decide for them, not just show them

Trust is built, not assumed

Personal, not generic

Privacy by architecture

A system that earns trust before it acts

Learning — Calibrating to You

Cleaning — Full Gallery Analysis

The decisions that shaped the product

Why two phases instead of one?

Choosing the right LLM

Why on-device, not cloud?

Scoring on a spectrum, not a binary

One photo at a time

Deliberate friction before deletion

Free at launch, monetize later

AI-powered keyword search: find any photo by describing it

High-level technical architecture

From code to Play Store

Product development

siftyai.com

Play Store submission and ASO

Organic growth

Real metrics, not vanity numbers

What I learned and what's next

Try Sifty AI yourself

Building Sifty AI: From Personal
Frustration to Shipped Product