A few months ago, I found myself stuck in a loop that probably feels familiar to a lot of people early in their careers.

I would open job descriptions for data scientist roles, scroll through the requirements, and walk away with the same feeling every time:

“I need to learn… everything. And that’s impossible.”

Instead of acting on it, I’d end up scrolling social media, comparing myself to others, and feeling even more stuck. It was a bad cycle, and I knew I had to break it.

That’s when I decided to treat it like a math problem—because that’s how I was trained. When something feels uncertain, my instinct is to reduce ambiguity using what is known. We define the space, identify constraints, and try to approximate what we don’t yet understand. That mindset worked well in areas like optimization and functional analysis, so I wondered: could I apply the same thinking here?

The job market felt chaotic and uncertain—hundreds of postings, evolving tools, and vague, inconsistent requirements. At first, I relied on intuition: “this skill seems popular,” or “maybe I should learn that.” But it felt reactive and unstable. Each job description made sense on its own, but together they created noise. In the AI era, that noise only gets louder. Information is everywhere. Advice is everywhere. But clarity isn’t.

So I reframed the problem. Instead of asking what to learn next, I asked:

“Which skills would increase my alignment with the largest number of roles I care about, given limited time and energy?”

That shift changed everything. I stopped treating job postings as isolated signals and started treating the market as my dataset. Instead of scraping everything, I defined a scope: entry-level data scientist roles at companies I’m genuinely interested in. I wasn’t modeling the entire market—I was modeling my feasible region.

Within that scope, I built a system on AWS to extract skills from job descriptions, track demand patterns, and compare them directly with my resume. The goal was simple: turning messy information into signals I could actually act on.

image.png

Once the pipeline was running, a pattern started to emerge. A small number of skills showed up consistently across many roles, while others had much lower impact. That was the first sign that the problem was more structured than it felt. The market signal was not telling me to learn everything. It was pointing to a narrower set of high-leverage skills that mattered repeatedly across the roles I cared about.

By separating skills into technical and soft categories, I could answer a broader question: was the market only asking for technical depth, or something more? The data suggested it was much more than technical depth alone. Roughly 30% of the top skills were soft skills, and communication appeared in about 90% of the job descriptions. Employers were also consistently looking for collaboration and business-facing problem solving. Those qualities are much harder to signal through a resume alone, which also helped me understand why interviews matter so much: they are often where those skills are evaluated more directly.

image.png

One of the most surprising findings was that my resume already covered roughly 85% of the tracked skill demand across the roles I selected. That did not mean I was automatically a strong fit for every role, but it did challenge my initial assumption that I was missing everything. The problem seemed to be less about adding endless new skills, and more about two things: first, identifying the small number of missing skills that truly mattered; second, thinking about how my existing skills were being interpreted in a crowded market.

image.png

Once that overlap became clear, the project shifted from diagnosing gaps to deciding what was actually worth doing next.

The pipeline now serves two purposes. First, for the most important skills that are not yet covered in my resume, it generates a short explanation of why each one matters and suggests one concrete first step, such as watching an introductory tutorial or reading a practical blog post. The goal is not to chase every keyword, but to make exploration more intentional and easier to start.

Second, I use the pipeline to monitor how the market signal changes over time. The animation below tracks the top 10 skill demands by week across the roles I care about. Some skills remain consistently important, while others rise or fade. That matters because it helps me distinguish one-off mentions from more persistent signals.