Lean path to become a successful data scientist

Data Scientist

Why become a data scientist?

A career in data science offers a unique combination of high demand, high salaries, opportunities for growth, and the chance to work on meaningful problems. If you’re interested in mathematics, computer science, and solving real-world problems, becoming a data scientist is the way to go.

Required Skills for becoming a data scientist

Math

Mathematics is the language of the data. Data is represented through numbers, tables, matrices, series etc. Mathematical operations and equations are key to get information out of the data. But we don’t need to learn scary stuff to get started, high school math will suffice. Here is a comprehensive list of topics to get you started:

Probability and Statistics

Linear Algebra

Calculus

Transform Theory (optional)

Information theory (optional)

Programming

Programming is key skill required to implement mathematical concepts learnt on the underlying data. Here is a list of topics and courses to get you started.

Choose a programming Language

I prefer Python, as it is widely adopted, versatile, has amazing community and is easy to learn.

OOPs Concepts

Data Structure and Algorithms

Data Mining and Analysis

So far we have looked at prerequisites for data science. Now comes the core application part. First steps into the journey of data science starts with data mining and analysis.

Data Processing

Exploratory data analysis and visualizations

Association rule mining and clustering

If you are with me so far you will be competent enough to become a data analyst, which is entry level position in the field of data science.

Machine Learning Algorithms

above course covers all machine learning algorithms you need to know as a beginner in grave detail. but for sake of saving some time I’ll list down resources for individual algorithms as well.

linear regression

Logistic regression

Decision tree

Support vector machines

Naive bayes

Random forest

Xgboost

Dimentionality reduction

Clustering

Recommendation system

Model Deployment

Data storage

Data processing

Cloud for machine learning

Data versioning

Experiment tracking

Model serving

Conclusions

I know its a long way ahead if you are just starting your journey, but you don’t have to wait till you complete everything I listed here. There are several checkpoints where you can start your professional journey. For instance you can start your career as a data analyst once you know the core math and basic programming, and slowly keep on building your skills to move further ahead in your career.

One last tip though

You don’t rise to the level of your goals, you fall down to the level of your system.

James Clear, Atomic Habits

So make a habit of learning!

Leave a Reply

Your email address will not be published. Required fields are marked *