khuyentran1401/Data-science
Collection of useful data science topics along with code and articles
repo name | khuyentran1401/Data-science |
repo link | https://github.com/khuyentran1401/Data-science |
homepage | https://khuyentran1476.medium.com/ |
language | Jupyter Notebook |
size (curr.) | 91123 kB |
stars (curr.) | 396 |
created | 2020-07-17 |
license | |
Data Science Topics
Collection of useful data science topics along with code and articles in my data science blog. If you are using an RSS feeds reader, add this link to your reader to get the notifications on my latest articles.
If you prefer shorter versions of these blogs, subscribe to Data Science Simplified to receive bite-sized data science tips in your mailbox daily.
Contents
- Data Science Tools
- Machine Learning
- Natural Language Processing
- Computer Vision
- Time Series
- Visualization
- Mathematical Programming
- Scraping
- Python
- Terminal
- Linear Algebra
- Data Structure
- Statistics
- Applications
- Learning Tips
- Productive Tips
- Book Review
- Data Science Portfolio
Data Science Tools
Title | Medium article | Repository |
---|---|---|
How to Create Fake Data with Faker | link | link |
How to Embed Interactive Charts on your Medium Articles and Personal Website | link | link |
How to Monitor And Log your Machine Learning Experiment Remotely with HyperDash | link | link |
How to Leverage Visual Studio Code for your Data Science Projects | link | |
Introduction to Datapane: A Python Library to Build Interactive Reports | link | |
Datapane’s New Features: Create a Beautiful Dashboard in Python in a Few Lines of Code | link | link |
How to Share your Python Objects Across Different Environments in One Line of Code | link | link |
Top 4 Code Viewers for Data Scientist in VSCode | link | |
How to Create and View Interactive Cheatsheets on the Command-line | link | |
Introduction to Hydra.cc: A Powerful Framework to Configure your Data Science Projects | link | |
How to Share your Jupyter Notebook in 3 Lines of Code with Ngrok | link | |
Introduction to Schema: A Python Libary to Validate your Data | link | link |
Pytest for Data Scientists | link | link |
Introduction to DVC: Data Version Control Tool for Machine Learning Projects | link | link |
Incorporate the Best Practices for Python with These Top 4 VSCode Extensions | link | |
4 Lessor-Known Yet Awesome Tips for Pytest | link | link |
How to Effortlessly Publish your Python Package to PyPI Using Poetry | link | link |
Machine Learning
Title | Medium article | Repository |
---|---|---|
How to Efficiently Fine-Tune your Machine Learning Models | link | link |
How to Learn Non-linear Dataset with Support Vector Machines | link | link |
Introduction to IBM Federated Learning: A Collaborative Approach to Train ML Models on Private Data | link | link |
3 Steps to Improve your Efficiency when Hypertuning ML Models | link | |
Train your Machine Learning Model 150x Faster with cuML | link | link |
Introduction to Weight & Biases: Track and Visualize your Machine Learning Experiments in 3 Lines of Code | link | link |
Natural Language Processing
Title | Medium article | Repository |
---|---|---|
Sentiment Analysis of LinkedIn Messages | link | link |
Find Common Words in Article with Python Module Newspaper and NLTK | link | link |
How to Tokenize Tweets with Python | link | link |
How to Solve Analogies with Word2Vec | link | link |
What is PyTorch | link | link |
Convolutional Neural Network in Natural Language Processing | link | link |
Supercharge your Python String with TextBlob | link | link |
pyLDAvis: Topic Modelling Exploration Tool That Every NLP Data Scientist Should Know | link | link |
Streamlit and spaCy: Create an App to Predict Sentiment and Word Similarities with Minimal Domain Knowledge | link | link |
Computer Vision
Title | Medium article | Repository |
---|---|---|
How to Create an App to Classify Dogs Using fastai and Streamlit | link | link |
Time Series
Title | Medium article | Repository |
---|---|---|
3 Ways to Extract Features from Dates with Python | link | link |
Visualization
Title | Medium article | Repository |
---|---|---|
What I Learned from Scraping 15k Data Science Articles on Medium | link | link |
How to Create Interactive Plots with Altair | link | link |
How to Create a Drop-Down Menu and a Slide Bar for your Favorite Visualization Tool | link | link |
I Scraped more than 1k Top Machine Learning Github Profiles and this is what I Found | link | link |
Top 6 Python Libraries for Visualization: Which one to Use? | link | link |
Introduction to Yellowbrick: A Python Library to Visualize the Prediction of your Machine Learning Model | link | link |
Visualize Gender-Specific Tweets with Scattertext | link | link |
Visualize Your Team’s Projects Using Python Gantt Chart | link | link |
Mathematical Programming
Title | Medium article | Repository |
---|---|---|
How to choose stocks to invest in with Python | link | link |
Maximize your Productivity with Python | link | link |
How to Find a Good Match with Python | link | link |
Scraping
Title | Medium article | Repository |
---|---|---|
Web Scrape Movie Database with Beautiful Soup | link | link |
top-github-scraper: Scrape Top Github Users and Repositories Based On a Keyword in One Line of Code | link | link |
Python
Title | Medium article | Repository |
---|---|---|
Numpy Tricks for your Data Science Projects | link | link |
Cython-A Speed-Up Tool for your Python Function | link | link |
Timing for Efficient Python Code | link | link |
How to Use Lambda for Efficient Python Code | link | link |
Python Tricks for Keeping Track of Your Data | link | link |
Boost Your Efficiency With Specialized Dictionary Implementations in Python | link | link |
Dictionary as an Alternative to If-Else | link | link |
How to Use Zip to Manipulate a List of Tuples | link | link |
Get the Most out of Your Array With These Four Numpy Methods | link | link |
3 Python Tricks to Read, Create, and Run Multiple Files Automatically | link | link |
How to Exclude the Outliers in Pandas DataFrame | link | link |
Python Clean Code: 6 Best Practices to Make Your Python Functions More Readable | link | link |
3 Techniques to Effortlessly Import and Execute Python Modules | link | link |
Terminal
Title | Medium article | Repository |
---|---|---|
Understand CSV Files from your Terminal with XSV | link | |
Prettify your Terminal Text With Termcolor and Pyfiglet | link | link |
Stop Using Print to Debug in Python. Use Icecream Instead | link | |
Rich: Generate Rich and Beautiful Text in the Terminal with Python | link | link |
Create a Beautiful Dashboard in your Terminal with Wtfutil | link | link |
Linear Algebra
Title | Medium article | Repository |
---|---|---|
How to Build a Matrix Module from Scratch | link | link |
Linear Algebra for Machine Learning: Solve a System of Linear Equations | link | link |
Data Structure
Title | Medium article | Repository |
---|---|---|
Convex Hull: An Innovative Approach to Gift-Wrap your Data | link | link |
How to Visualize Social Network With Graph Theory | link | link |
How to Search Data with KDTree | link | link |
How to Find the Nearest Hospital with a Voronoi Diagram | link | link |
Statistics
Title | Medium article | Repository |
---|---|---|
Can Datasets of a Dinosaur and a Circle have Identical Statistics? | link | |
Introduction to One-Way ANOVA: A Test to Compare the Means between More than Two Groups | link | link |
Applications
Title | Medium article | Repository |
---|---|---|
How to Create an Interactive Startup Growth Calculator with Python | link | link |
Streamlit and spaCy: Create an App to Predict Sentiment and Word Similarities with Minimal Domain Knowledge | link | link |
Learning Tips
Title | Medium article | Repository |
---|---|---|
How to Learn Data Science when Life does not Give You a Break | link | |
How to Accelerate your Data Science Career by Putting yourself in the Right Environment | link | |
To become a Better Data Scientist, you need to Think like a Programmer | link | |
How not to be Overwhelmed with Data Science | link |
Productive Tips
Title | Medium article | Repository |
---|---|---|
Boost Your Efficiency with Customized Code Snippets on VSCode | link | |
How to Organize your Data Science Articles with Github | link | link |
How to Create Reusable Command-Line | link | |
5 Reasons why you should Switch from Jupyter Notebook to Scripts | link | |
3 Ways to Get Notified with Python | link | link |
Top 9 Keyboard Shortcuts in VSCode for Data Scientists | link | |
7 Reasons Why you Should Start Documenting your Code | link |
Book Review
Title | Medium article | Repository |
---|---|---|
Python Machine Learning: A Comprehensive Handbook for Machine Learning | link |
Data Science Portfolio
Title | Medium article | Repository |
---|---|---|
How to Create an Elegant Website for your Data Science Portfolio in 10 minutes | link | |
Build an Impressive Github Profile in 3 Steps | link |