Navigation: <-- Why a Project? | Part Index | Main Index | Working as a Team on a DS Project -->
Finding Your Idea
Requires: Why a Project?
Motivation: You know why a project is worth doing. The harder question can sometimes be: What should it be? Worry not; you probably have more potential ideas than you realize.
This nugget gives you a set of prompts to surface candidate problems from your own background, a template to sharpen each one into a concrete, workable question, and a short list of examples to calibrate what a completed project looks like.
Table of Contents
- Where to Look for Problems
- From Problem to a Workable Question
- Committing to One Idea as a Group
- Summary
Where to Look for Problems
Good ideas rarely arrive fully formed. They start as a vague sense that something could be better understood, made faster, or done more reliably.
The place to start is not "What could a model do?" but "Where does something feel difficult or uncertain?"
Here are three entry points. For each one, ask yourself the accompanying questions and write down anything that seems plausible. Do not filter yet.
Write down 2-3 problem candidates before moving on. Do not evaluate them yet.
Solve a problem for yourself
- What do you do repeatedly in your work that is slow, error-prone, or relies on judgment you cannot easily articulate?
- Is there a quantity you estimate by hand? A decision you make from experience, but would struggle to write down as rules?
Who benefits? You do โ and people in similar roles.
Help someone else or the community
Someone else's workflow problem often makes an excellent project because the question is already real: someone cares about the answer.
- Ask a colleague, friend, or family member what they spend time on that feels repetitive or uncertain.
Who benefits? Name them specifically. If you cannot, the problem may not be concrete enough yet.
Follow a curiosity
Look at data you already have โ calibration records, measurement logs, sensor archives, a spreadsheet you have kept for years โ and ask: What structure might be here that is worth examining systematically?
Who benefits? Anyone who uses or depends on that data.
A few example ideas
Here are some exemplary, still vague problem statements as may be typical when the idea first surfaces. Though the following examples are mostly engineering-specific, you are WELCOME to figure out anything that you are interested in yourself.
- "The spectra look different depending on material, but we have never examined why."
- "Our sensor seems to degrade, but we have no systematic way to tell when it is about to fail."
- "Yield varies batch to batch and nobody is sure what drives it."
From Problem to a Workable Question
Once you have candidate problems, the next step is to check how they can be framed as a data science question. The framing matters because it determines what "done" looks like and which tools apply.
Step 1: Recognize What Kind of Question It Is
Scan the first column below (ignore the last column, it's just for look-up later). Which business question sounds like yours?
| Business question | Verb | Core methods |
|---|---|---|
| How much or how many? | estimate | Regression |
| Which category does this belong to? | classify | Classification |
| What will happen over time? | forecast | Time series |
| Who or what is similar? | group | Clustering |
| What is unusual or wrong? | detect | Anomaly detection |
| What should I show or suggest? | recommend | Recommender systems |
| What is most relevant or risky? | rank | Learning to rank, scoring |
| What drives this outcome? | explain | Causal inference, interpretability |
| What is the underlying structure? | reduce | Dimensionality reduction, embeddings |
If a row resonates, note the verb and use it in the next step. Try to find the best fit. Everything can be refined later.
Step 2: Frame It as a Sentence
Complete the sentence: "I want to [VERB] [WHAT] from [DATA]."
- WHAT is what you want to know, produce, or uncover (derived from your problem statement).
- DATA is the data you have or could collect.
Here are the three example ideas from the previous section with the template applied:
| Vague problem | Translated question | Type |
|---|---|---|
| "The spectra look different depending on material, but we have never examined why." | "I want to group unlabelled spectra into material categories from their measured features." | Group |
| "Our sensor seems to degrade, but we have no systematic way to tell when it is about to fail." | "I want to detect imminent failures in a sensor component from its usage patterns." | Detect |
| "Yield varies batch to batch and nobody is sure what drives it." | "I want to estimate batch yield from process temperature and pressure logs." | Estimate |
If you can fill in WHAT and DATA with concrete things (like the bold pieces in the table), you have a workable question. If you cannot, the problem needs more specificity, not more ambition.
The framing template provided here is a translation tool. It may not be perfect for all ideas you have. In doubt: adjust wording slightly. Carry two or three sharpened questions into the next section.
Committing to One Idea as a Group
Each team member now has two or three sharpened candidate questions. The next task is to select one.
This is a coordination problem, not a technical one. The goal is to find a question the whole group can invest in, where accessible data already exists, and where the framing is concrete enough to get started.
A workable selection process:
- Each person presents their candidates using the sentence template. One sentence per idea, plus who benefits.
- After everyone has presented, mark the ideas that already have a known data source behind them. Ideas without a clear data source are harder to move forward at this stage.
- Discuss genuine interest and feasibility. You are looking for overlap: a question that at least two people find interesting and where the group believes data is accessible.
- When there is no clear winner, choose the idea with the most concrete data source, not the most ambitious question.
Once your group commits to a question, it is the group's question, regardless of who proposed it.
You will not know for certain whether the idea is feasible at this point. That is fine: ๐ Reality-Checking Your Idea is exactly for that. Commit now, verify next.
Summary
- Start with the problem, not the solution: ask where something is difficult or uncertain before asking what a model could do.
- Three reliable sources: your own workflow, problems you observe in others, and data you already have.
- Write several candidates before filtering; name who benefits for each.
- Scan the problem-type table by business question to recognize which kind of question yours is, then frame it as "I want to [VERB] [WHAT] from [DATA]."
- For group selection: mark ideas with known data sources, look for shared interest, and when in doubt pick the most concrete option over the most ambitious one.
As always: Happy learning, happy life! ๐ซถ
Navigation: <-- Why a Project? | Part Index | Main Index | Working as a Team on a DS Project -->
Script v1.4 (2026-06-10) ยท FGN