What is data science?

“Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from data in various forms, both structured and unstructured.”– Wikipedia

The goal of data science is to make sense of data in a way that is meaningful and can drive better decisions.

Imagine you were going to join a new school, with new kids, and you would be leaving your friends from your other school. Finding new friends from all those new kids would be quite a tiring task. How would you know which kids, from the new school, were more likely to be your friends?

A clever person like you, would write down all the characteristics you know about your current friends. For eg.

My Friends

You can see that all your friends like cartoons, even if a little. All of them, except one, have blue as their favourite color, and most of them love Math. So now we could use that information to find out which of the kids in your new school are most likely to be your friends.

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/a694c123-ffa9-466e-8eae-1dd5a2f9b4ee/venn_diagram_(1).jpg

If their favourite color is blue, and they like cartoons, they are extremely likely to be your friend. Still, any kid who loves cartoons is very likely to be your friend, even if their favourite color is red. Their favourite subject doesn’t matter much, but someone whose favourite color is blue, has Math as their favourite subject, and is a cartoon maniac is probably going to make a very good friend.

Now, that’s Data Science right there. It’s getting data, arranging it in a way it can be easily understood, and making decisions out of it.

You did several things to find new friends for yourself -

https://www.youtube.com/watch?v=xC-c7E5PK0Y

Real examples how Data Science is being used

Identifying Breast Cancer