Data Science Shorts
20 Key Things Every Data Scientist Needs To Understand to Be in the Top 1%
“The best way to learn data science is to do data science”.
- Your first model will not be the best one; reiteration is the key.
- Documenting your code is equally important as finding ideal parameters for models.
- It's always a good idea to divide your analysis into different files.
- Simplicity often outperforms complexity; don’t assume complex models are the solutions to every problem.
- Knowledge of databases is as important as knowledge of different ML algorithms.
- A good understanding of the problem domain will enhance the quality of your model.
- Your insights are valuable only if others can understand them;
- Always divide your time to be spent on a project into different parts where data analysis should own the biggest pie.
- Never make assumptions based on base param score
- It's a good idea to Group Bar and Pie Charts Together.
mean()is not the answer always; Try using other methods for replacing missing values.- Always place labels on your graphs; a million-dollar painting without a signature is of no worth.
- Make sure to try different evaluation metrics.
- Automate certain tasks that don’t require creativity.
- 1% of clean data is always better than 99% of garbage; data preprocessing is the key;
- Algorithms are tools, not magic wands; understand them deeply.
- “Data science is not a playground for developers to mimic”; it is a scientific expedition, where each analysis is a unique exploration.
- Never Output a Particular Number For Regression; Make use of probability and range;
- Try to use
sample()instead ofhead()andtail()to get a better understanding of data. - If you can’t explain your analysis it's pure garbage.
Gained a thing or two?
Let me know in the comments below 👇
Thanks For Reading Till Here, If You Like My Content and Want To Support Me The Best Way is —
- Follow Me On Medium.
- Connect With Me On LinkedIn.
- Attach yourself to My Email List to never miss reading another article of mine
