my photo

Data Science Portfolio for Brian Beames

Resources

This page showcases the tools and resources I have including github repositories, data science articles, bookmarks, and books.

Github Repositories

tensorflow models
tensorflow docs
tensorflow tpu
tensorflow tensorflow
tensorflow community
Pretrained models for TensorFlow.js
Books:
joelgrus data-science-from-scratch
wesm pydata-book
fastai fastbook
Effective Python: Second Edition Source Code and Errata for the Book
Grokking Deep Learning
Deep Reinforcement Learning Hands On Second Edition
The official code repository for examples in the O Reilly book called Generative Deep Learning
Exercises for the Deep Learning textbook at www.deeplearningbook.org
Jupyter notebooks for the code samples of the book Deep Learning with Python
The Python code to reproduce the illustrations from The Hundred Page Machine Learning Book
Companion repository for the book Building Machine Learning Powered Applications
Code samples for my book Neural Networks and Deep Learning
Full Speed Python: a book for self learners
deep learning cookbook
Hands On Computer Vision with TensorFlow 2 published by Packt
Python Data Science Handbook: full text in Jupyter Notebooks
Source files for Learning Statistics with R
Lauren t Rosenfeld thinkperl6
The Jupyter Notebooks behind my OReilly report A Whirlwind Tour of Python
LaTeX source and supporting code for Think Python 2nd edition by Allen Downey.
Example code for the book Fluent Python
Code exploration from Deep Learning for the Life Sciences: Applying Deep Learning to Genomics Microscopy Drug Discovery and More
The probability and statistics cookbook
Notebooks and code for the book Introduction to Machine Learning with Python
Git repository for Think Stats 2nd Ed
Python code for the free book A Programmers Guide to Data Mining
Hadoop illuminated hadoop book
Courses:
Coursera Course materials for the Data Science Specialization
Data Visualization IBM Practice Coursera
UCSanDiegoX edX Course DSE210x Statistics and Probability in Data Science using Python
Lecture Slides and R Sessions for Trevor Hastie and Rob Tibshinaris Statistical Learning Stanford course
Interview questions:
data science interview questions and answers
Answers to 120 commonly asked data science interview questions
Miscellaneous:
pandasgui
A tool for parsing breached passwords
Probabilistic Programming and Bayesian Methods for Hackers
CenterForOpenScience
automatebot This bot will automate your reporting task by querying the data from Google BigQuery create a visualization and automatically send the image through Telegram Chat
python guide
digiCamControl DSLR camera remote control open source software
PythonEXE How to create an executable file from a Python script
ds cheatsheets I have all of these cheatsheets downloaded on my computer.
cookiecutter data science
misc projects
data science template
Statsmodels: statistical modeling and econometrics in Python
scikit-learn: machine learning in Python
pandas
Jupyter metapackage for installation docs and chat
A gallery of interesting Jupyter Notebooks
Custom Jupyter Notebook Themes
TheAlgorithms Python
Solutions for various coding algorithmic problems and many useful resources for learning algorithms and data structures
learn python3
kaggle api
SIIM ISIC Melanoma Classification 1st Place Solution
Python programming exercises
python reference
Playground and cheatsheet for learning Python Collection of Python scripts that are split by topics and contain code examples with explanations
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit Learn Keras and TensorFlow 2
A Python library for easy data analysis visualization exploration and modeling
Your new Mentor for Data Science E Learning
An open source toolkit for large scale genomic analysis
A python library built to empower developers to build applications and systems with self contained Computer Vision capabilities
PEP 8 for Humans
bitcoin
An open source framework that provides a simple universal API for building distributed applications Ray is packaged with RLlib a scalable reinforcement learning library and Tune a scalable hyperparameter tuning library
Modin: Speed up your Pandas workflows by changing a single line of code
Agile Data Preparation Workflows made easy with dask cudf dask cudf and pyspark
Data Version Control Git for Data & Models
Data Science Using Python
folium Python Data Leaflet js Maps
Plotly Express Simple syntax for complex charts
Jupyter handsontable integration
A Jupyter Leaflet js bridge
Interactive Widgets for the Jupyter Notebook
fuzzywuzzy Fuzzy String Matching in Python
A simple and efficient tool to parallelize Pandas operations on all available CPUs
A Jupyter Three js bridge
TannerGilbert Tutorials
This is a template for creating a Machine Learning application with its front end developed using React which interacts with a Flask service as the back end and makes predictions
awesome public datasets
This repo contains tutorials on OpenCV Python library using new cv2 interface
High level tools to simplify visualization in Python
This repository contains all of the code related to my articles published in Towards Data Science on Medium from Februray 2019 onwards
Python implementations of the k modes and k prototypes clustering algorithms for clustering categorical data
Notebook to download machine learning flashcards
Python bindings for cairo
minimal flask example appengine
How to do data science with Optimus Spark and Python
Gallery
Some jupyter notebook
An interactive grid for sorting filtering and editing DataFrames in Jupyter notebooks
A collection of IPython notebooks covering various topics.
Gentle introduction to dash development and deployment via Heroku
Installing R and RStudio and setup R using Powershell
Open source Python module for computer vision
Exploratory Data Analysis EDA is performed on the E Commerce data obtained from a UK based and registered non store online retail to discover interesting transactional patterns of different customers and countries
This repo serves as code reference for the following TDS post A starter pack to exploratory data analysis with python pandas seaborn and scikit learn
pgmagick is a yet another boost python based wrapper for GraphicsMagick ImageMagick
3D convolutional autoencoder for fmri volumes Learning spatial and temporal features of fMRI brain images
A template for research or data analysis projects structured as R packages
Seaborn Visualizations
Build your own insert technology here
Freely available programming books
A delightful community driven with 1700 plus contributors framework for managing your zsh configuration Includes 200 plus optional plugins rails git OSX hub capistrano brew ant php python etc over 140 themes to spice up your morning and an autoupdate tool so that makes it easy to keep up with the latest updates from the community.
A complete computer science study plan to become a software engineer.
A collection of useful gitignore templates
Learn how to design large scale systems Prep for the system design interview Includes Anki flashcards
A collective list of free APIs for use in software and web development
Master the command line, in one page
Algorithms and data structures implemented in JavaScript with explanations and links to further readings
Roadmap to becoming a web developer in 2020
Machine learning Deep Learning CNN with PyTorch
Command-line program to download videos from YouTube.com and other video sites

Data Science Articles

These are the articles I have currently downloaded on my computer. If the link you click on goes to my data science profile, this means the author took down that article off of the web. Click on the resources tab at the left to get back to this page.

Artificial Intelligence:
Explainable AI: From Prediction To Understanding
Explainable AI: Interpreting the neuron soup of deep learning
Intro to FastAI: Installation and Building our First Classifier
Top 10 roles in AI and data science
Autoencoder:
Image Compression Using Autoencoders in Keras
what is variational autoencoder vae tutorial
Understanding Vector Quantized Variational Autoencoders VQ VAE
Big Data:
Apache Spark Optimization Toolkit
High Level Overview of Apache Spark
Learn Spark for Big Data Analytics in 15 mins
Spark core concepts explained
Stop using Pandas and start using Spark with Scala
The Big Data Handbook
Training multiple machine learning models and running data tasks in parallel via YARN Spark multithreading
Unraveling the Staged Execution in Apache Spark
Books Suggested:
10 Programming Books you Need to Read
15 Artificial Intelligence Books You Should Read
Careers in Data Science:
Freelance:
How to Become a Freelance Data Scientist Springboard blog
How to Become a Freelance Data Scientist
My Experience as a Freelance Data Scientist
Careers in Data Science Main Folder
4 important reasons so many data scientists are leaving their jobs
8 Useful Advices for Aspiring Data Scientists
9 Reasons why youll never become a Data Scientist
11 Remote Workers on the Strategies They Use to Bond With Co Workers
A Day in the Life of a Data Scientist
Advice for Applying to Data Science Jobs
A guide for applying to data science jobs
Advice For New and Junior Data Scientists
All You Need to Know to Break into the Data World and Machine Learning
But what is this machine learning engineer actually doing
How to become a data scientist: A cheat sheet
Data Scientists: Why are they so expensive to hire
How I landed offers from Microsoft Amazon and Twitter without an Ivy League degree
How to Become a Data Scientist Without a Degree
How To Become a Data ScientistWithout CS Degree
How To Get Your Data Scientist Career Started
How To Go Into Data Science
How to land a Data Scientist job at your dream company My journey to Airbnb
How To Market Yourself As a Programmer
How to Spot a Fake Data Scientist
How to Think Like a Data Scientist in 12 Steps
I had no idea how to write code two years ago Now Im an AI engineer
If youre a developer transitioning into data science here are your best resources
My Weaknesses as a Data Scientist
Planning Your Next Career Move With Stack Overflows 2019 Survey
Remote Data Science Internships For Everyone With Certificates
Screening candidates for data science positions my experiences
Six Recommendations for Aspiring Data Scientists
Succeeding as a data scientist in small companies startups
Teach Yourself Data Science: the learning path I used to get an analytics job at Jet dot com
The Data Science Interview Study Guide
The Data Scientist Shortage is Huge Heres How to Beat It
The Kinds of Data Scientist
The Most In Demand Skills for Data Scientists
The online courses you must take to be a better Data Scientist
The Third Wave Data Scientist
The Two Sides of Getting a Job as a Data Scientist
The Ultimate Guide to Learning to Code and Getting Paid
To get hired as a data scientist dont follow the herd
Ugly Truths About Working From Home
What Experience is Worth Finding Good Data Scientists
Why you shouldnt be a data science generalist
You Can Get A Data Analytics Job Without A Masters In Data Science
Clustering:
An introduction to clustering algorithms
Into the world of clustering algorithms: k means k modes and k prototypes
K means Clustering Python Example
Spectral Clustering Algorithm Implemented From Scratch
Three Popular Clustering Methods and When to Use Each
When Clustering Doesnt Make Sense
Dashboarding:
Dashboard Design: 8 Types of Online Dashboards
The best tools for Dashboarding in Python
Data Science in General:
Concerning Data:
Data Types for Data Sciences
Dealing with absence of value
How to Handle Missing Data
The complete beginners guide to data cleaning and preprocessing
Understanding data Musings on information memory analytics and distributions
8 Common Data Structures every Programmer must know
The top data structures you should know for your next coding interview
Data Preprocessing Concepts with Python
Data Science Portfolio:
How To Create a Data Science Portfolio Website
A Checklist for Preparing your Data AI Portfolio
Put together a data science portfolio and get noticed
Data Science Projects:
Building an End To End Data Science Project
How do data science projects work
How to build a data science project from scratch
How to Organize Your Data Science Project
Learn to build an end to end data science project
Multicollinearity Impacts Your Data Science Project More Than You Know
Programming in General:
5 Bad Habits of Absolutely Ineective Programmers
6 Programming Habits That Surprisingly Not Many Developers Have
10 Extraordinary GitHub Repos for All Developers
30 Things I Wish I Knew When I Started Programming
40 Tips that will change your coding skills forever
How Much Programming do I need in Data Science
Is programming a must for applying Data Science
The Powerful Dierences Between Good and Great Programmers
Tips for Writing Self Documenting Code
Top 10 Coding Mistakes Made by Data Scientists
Using Notepad plus plus for Version Control
Ways to Learn Data Science:
6 Data Science Certificates To Level Up Your Career
Programming Skills A Complete Roadmap for Learning Data Science Part 1
Data Analysis A Complete Roadmap for Learning Data Science Part 2
Maths and Statistics A Complete Roadmap for Learning Data Science Part 3
How to Learn Data Science for Free
How To Learn Data Science If Youre Broke
The Art of Learning Data Science
The Fastest Way to Learn Data Science
Top 9 Data Science certifications to know about in 2020
Workflow:
3 Tips to Improving Your Data Science Workflow
A Data Science Workflow
Data Science Workflow
Data Science in General Main Folder:
20 Core Data Science Conceptsfor Beginners
24 Ultimate Data Science Machine Learning Projects To Boost Your Knowledge and Skills and can be accessed freely
A long term Data Science roadmap which WONT help you become an expert in only several months
Common data science pitfalls and how to avoid them
How to Build a Data Science Portfolio
How to sharpen your data instincts
What is the key skill that the best data scientists have
The Things You Need to Consider for Your Coding Portfolio
Top 5 Free Resources for Learning Data Science
Top 5 tech skills data scientists need, and how to learn them
Top 10 Nice To Have Data Science Libraries
What 70 percent of Data Science Learners Do Wrong
What on earth is data science
Data Visualization:
Colors:
Another Post About Colours for Data Visualisation Part 1 Data Types
Another Post About Colours for Data Visualisation Part 2 Colour Schemes
Another post about colours for data visualisation Part 3 DIY Palettes
The Importance Of Color In Data Visualizations
Plotly:
4 Reasons Why Im Choosing Plotly as My Main Visualization Library
How To Create a Plotly Visualization And Embed It On Websites
Introducing Plotly Express
Its 2019 Make Your Data Visualizations Interactive with Plotly
Plotly Experiments Scatterplots
Plotly dot py 4 point 0 is here Offline Only Express First Displayable Anywhere
Seaborn:
Create basic graph visualizations with SeaBorn
Data Visualisation Using Seaborn
Data Visualization Main folder:
3 Simple Reasons Why You Should First Visualize Data Before Doing Anything Else
4 Parameter Query Functions to Make your Data Visualization Interactive
5 Amazing Tips for Data Visualization
5 Quick and Easy Data Visualizations in Python with Code
5 Ways to Lie With Charts
9 Data Visualization Techniques You Should Learn in Python
9 Data Visualization Tools That You Cannot Miss in 2019
Advanced Visualization for Data Scientists with Matplotlib
An Introduction to Making Scientific Publication Plots with Python
Best Data Visualization Techniques for small and large data
Choosing one of many Python visualization tools
Designing Charts and Graphs: How to Choose the Right Data Visualization Types
Creating Beautiful Maps with Python Beyond the defaults
Data Visualization and Exploration using Pandas Only
Data Visualization 101: How to Choose the Right Chart or Graph for Your Data
Data Visualization Examples: A Look Into Modern Visual Innovation
Data Visualization for Artificial Intelligence and Vice Versa
Effectively Using Matplotlib
Effectively visualize data across time to tell better stories
Everything you need to know about Scatter Plots for Data Visualisation
Full Stack Visualizations For Complex Solutions For Data Scientists
How Can Beginners Design Cool Data Visualizations
How Do You Tell A Story With Data Visualization
Interpreting Data through Visualization with Python Matplotlib
Introduction to Data Visualization in Python How to make graphs using Matplotlib Pandas and Seaborn
Matplotlib Tutorial: Python Plotting
Python Plotting With Matplotlib Guide
Taking Data Visualization to Another Level
The Next Level of Data Visualization in Python
The Simple Yet Practical Data Visualization Codes
The Ultimate Technical Skill in Data Visualization for Data Scientists
How to create stunning visualizations using python from scratch
No More Basic Plots Please
Exploratory Data Science:
3 best practices for exploratory data visualizations
A Starter Pack to Exploratory Data Analysis with Python pandas seaborn and scikit learn
Exploratory Data Analysis EDA techniques for Kaggle competition beginners
Exploratory Data Analysis Made Easy Using Pandas Profiling
Exploratory Data Analysis with Pandas Profiling
Exploratory Data Analysis: A Practical Guide and Template for Structured Data
The Data Science Method DSM Exploratory Data Analysis
Top 5 Functions for Exploratory Data Analysis with Pandas
Image classifier:
Handwriting Number Recognizer:
Handwriting number recognizer with Flutter and Tensorflow part I
Handwriting number recognizer with Flutter and Tensorflow part II
Handwriting number recognizer with Flutter and Tensorflow part III
Handwriting number recognizer with Flutter and Tensorflow part IV
Handwriting number recognizer with Flutter and Tensorflow part V
Object detection:
Object detection via color based image segmentation using python
Object Detection with 10 lines of code
Train Object Detection AI with 6 lines of code
Image classifier main folder:
A single function to streamline image classification with Keras
Create your first Image Recognition Classifier using CNN Keras and Tensorflow backend
How to create a simple Image Classifier
NSFW Image Detector Using Create ML Core ML and Vision
Jupyter Notebook:
Basics and Keyboard Commands:
Back to basics Jupyter notebooks
In this notebook we cover some basics on Jupyter and its usage
Jupyter Notebook main folder:
4 Awesome Tips for Enhancing Jupyter Notebooks
Advanced Jupyter Notebooks A Tutorial
Boosting Your Jupyter Notebook Productivity
Bringing the best out of Jupyter Notebooks for Data Science
From Google Sheet to your Jupyter Notebook
How to create buttons in Jupyter
Interactive Controls in Jupyter Notebooks
Interactive spreadsheets in Jupyter
Jupyter is now a full fledged IDE
Jupyter Lab Evolution of the Jupyter Notebook
Jupyter Notebook Best Practices for Data Science
Jupyter notebooks tips and tricks
Jupyter Notebook Tricks for Data Science that Enhance your efficiency
Jupyter Superpower Interactive Visualization Combo with Python
Markdown in Jupyter Notebook
Productivity tips for Jupyter Python
Set Your Jupyter Notebook up Right with this Extension
Three Jupyter Notebook Extensions That Minimize Distractions
Version Control for Jupyter Notebook
Why Jupyter is data scientists computational notebook of choice
Kaggle competitions:
Generating Titles for Kaggle Kernels with LSTM Small Deep Learning Project with PyTorch
How a team of deep learning newbies came 3rd place in a kaggle contest Classifying images of oil palm plantations using fast ai
How to Participate in a Kaggle Competition with Zero Code
My secret sauce to be in top 2 percent of a kaggle competition
What my first Silver Medal taught me about Text Classification and Kaggle in general
Winning Model Documentation Guidelines
Machine Learning:
Algorithms:
A Tour of The Top 10 Algorithms for Machine Learning Newbies
Algorithm designs optimized machine-learning models up to 200 times faster than traditional methods
Algorithmic Trading Bot Python
Key Algorithms and Statistical Models for Aspiring Data Scientists
Ten Machine Learning Algorithms You Should Know to Become a Data Scientist
The 5 Feature Selection Algorithms every Data Scientist should know
The Fundamental Algorithms of Data Science
The Fundamental Algorithms of Data Science Part 2 Logistic Regression
TOP 10 Machine Learning Algorithms
XGBoost Algorithm Long May She Reign
Deep Learning:
5 reasons to choose PyTorch for deep learning
A newbies guide to build your own deep learning box
A Visual Intuition For Regularization in Deep Learning
Deep Learning Algorithms The Complete Guide
Deep Learning and Machine Learning Models Visualization
Deep Learning Framework Power Scores 2018
Deep learning isnt hard anymore
Deep learning Saving rainforests with TensorFlow
How to design deep learning models with sparse inputs in Tensorflow Keras
How to Develop Competence With Deep Learning for Computer Vision
How To Tag Any Image Using Deep Learning
Make deep learning faster and simpler
Medical Image Analysis with Deep Learning
Neural ODEs breakdown of another deep learning breakthrough
Which Deep Learning Framework is Growing Fastest
Estimating Uncertainty in Machine Learning:
Estimating Uncertainty in Machine Learning Models Part 1
Estimating Uncertainty in Machine Learning Models Part 2
Estimating Uncertainty in Machine Learning Models Part 3
Machine Learning main folder:
Construct a Decision Tree and How to Deal with Overfitting
Beyond CUDA GPU Accelerated Python for Machine Learning on Cross Vendor Graphics Cards Made Simple
4 easy steps to improve your machine learning code performance
4 Machine Learning Techniques with Python
10 Machine Learning Methods that Every Data Scientist Should Know
10 Most Popular Machine Learning Software Tools in 2020 updated
10 Must Try Open Source Tools for Machine Learning
20 Popular Machine Learning Metrics Part 1 Classification Regression Evaluation Metrics
A Beginners Guide to Automated Machine Learning and AI
A brief overview of Automatic Machine Learning solutions AutoML
A Guide to Decision Trees for Machine Learning and Data Science
A Machine Learning Guide for Average Humans
Deep learning vs machine learning a simple way to understand the difference
Absolute Beginning into Machine Learning
Architecting a Machine Learning Pipeline
Azure machine learning
Best Public Datasets for Machine Learning and Data Science Sources and Advice on the Choice
Compare which Machine Learning Model performs Better
Create a complete Machine learning web application using React and Flask
Data Manipulation for Machine Learning with Pandas
Dealing with the Lack of Data in Machine Learning
End To End Guide For Machine Learning Project
Essential libraries for Machine Learning in Python
Financial Machine Learning Part 0 Bars
Financial Machine Learning Part 1 Labels
Four machine learning tricks you should have known to win the Data Science Olympics 2019
Getting Started With Machine Learning
How to correctly select a sample from a huge dataset in machine learning
How to create a machine learning dataset from scratch
How to deliver on Machine Learning projects
How to get started with Machine Learning in about 10 minutes
I had no idea how to build a Machine Learning Pipeline But heres what I figured
Interpretable Machine Learning
Introduction to Machine Learning Top Down Approach
Is your Machine Learning Model Biased
Learning Machine Learning vs Learning Data Science
Machine Learning Basics with the K Nearest Neighbors Algorithm
Machine learning classification the success of Kickstarter tech projects
Machine learning Is the emperor wearing clothes
A Machine Learning Model to Detect Malware Variants
Machine Learning vs Traditional Programming
Machine Learning vs Statistics
Machine Learning Workows
Machine Learning Perfection always starts with mistakes
Maximum Likelihood Estimation VS Maximum A Posterior
No Machine Learning is not just glorified Statistics
Rules of Machine Learning
The ABCs of Machine Learning
The Problem Of Overfitting And How To Resolve It
Three steps for a successful machine learning project
Top 5 Machine Learning Projects for Beginners
Top Sources For Machine Learning Datasets
Transforming Skewed Data for Machine Learning
Understanding the 3 most common loss functions for Machine Learning Regression
Which machine learning model to use
Why use Machine Learning Instead of Traditional Statistics
Why were writing machine learning infrastructure in Go not Python
Natural Language Processing:
NLP Basics Measuring The Linguistic Complexity of Text
What is Natural Language Processing
Writing Linguistic Rules for Natural Language Processing
Neural Networking:
A Comprehensive Guide to Convolutional Neural Networks the ELI5 way
Build a simple Neural Network with TensorFlow js
Building a Convolutional Neural Network CNN in Keras
Convolutional Neural Networks: A Python Tutorial Using TensorFlow and Keras
Convolutional Neural Networks: Training an Image Classifier with Keras
Convolutional Neural Networks Python Tutorial Tenserflow Eager API
How to build your first Neural Network to predict house prices with Keras
How to build your own Neural Network from scratch in Python
Introducing Neural Structured Learning in TensorFlow
Introduction to Multilayer Neural Networks with TensorFlows Keras API
Neural Network for Satellite Data Classification Using Tensorflow in Python
Tutorial on Graph Neural Networks for Computer Vision and Beyond
Anisotropic, Dynamic, Spectral and Multiscale Filters Defined on Graphs
Useful Plots to Diagnose your Neural Network
Visualizing The Non-linearity of Neural Networks
What is Keras? The deep neural network API explained
Writing your first Neural Net in less than 30 lines of code with Keras
Orange:
Orange save model in python script
Program R:
Exploratory Data Data Analysis in R:
Exploratory Data Analysis in R for beginners Part 1
Exploratory Data Analysis in R for beginners Part 2
Program R Main Folder:
7 Visualizations You Should Learn in R
8 Useful R Packages for Data Science You Arent Using But Should
Data Science Project Template for R
From R vs Python to R and Python
How I used Python and R to analyze and predict Medical Appointment show ups
How to learn to program in R for free Quartz
How to run Python in R
Installing R using Powershell
Python vs R Choosing the Best Tool for AI ML and Data Science
Query Generation in R
R or Python Why not both Using Anaconda Python within R with reticulate
R Studio Shortcuts and Tips
R vs Python Whats The Dierence
Running your R script in Docker
Super Dark IDE Theme R Studio Inverted Color
Synthesising Multiple Linked Data Sets and Sequences in R
The bootstrap The Swiss army knife of any data scientist
Top R Packages for Data Cleaning
Python:
Dealing with Files in Python:
Automate These 3 Boring Excel Tasks with Python
Creating PDF Files with Python
Creating Presentations with Python
Invigorate Excel with Python
How to store digital les in a database in Python
Knowing these You Can Cover 99 percent of File Operations in Python
The easy way to work with CSV JSON and XML in Python
Features:
3 Neglected Features in Python 3 That Everyone Should Be Using
4 Hidden Python Features that Beginners should Know
5 Advanced Features of Python and How to Use Them
Awesome New Python 3 point 8 Features
New Features in Python 3 point 9
Take a Look at the Awesome New Features Coming in Python 3 point 9
Numpy:
Is NumPy really faster than Python
Loading NumPy arrays from disk mmap vs Zarr HDF5
Why Should We Use NumPy
What is npy files and why you should use them
Why You Should Start Using npy Files More Often
Other libraries:
5 Obscure Python Libraries Every Data Scientist Should Know
5 Underrated Python Libraries to Use in Your Next Data Science Project
Lesser Known Python Libraries for Data Science
Python Libraries for Interpretable Machine Learning
Top Python Libraries Numpy and Pandas
Pandas:
Selecting Subsets of Data in Pandas:
Selecting Subsets of Data in Pandas Part 1
Selecting Subsets of Data in Pandas Part 2
Selecting Subsets of Data in Pandas Part 3
Selecting Subsets of Data in Pandas Part 4
Pandas Main Folder:
3 steps to a clean dataset with Pandas
9 pandas visualizations techniques for effective data analysis
10 Python Pandas tricks that make your work more efficient
10 Things You Didnt Know About Pandas
12 Amazing Pandas and NumPy Functions
A Beginners Guide to Optimizing Pandas Code for Speed
A Complete Pandas Guide
A Comprehensive Guide to Pandas Advanced Features in 20 Minutes
A Guide to Pandas and Matplotlib for Data Exploration
Become a Pro at Pandas Pythons data manipulation Library
Combining Pandas DataFrames The easy way
Did You Know Pandas Can Do So Much
Effective Data Filtering in Pandas Using loc
Fast Flexible Easy and Intuitive How to Speed Up Your Pandas Projects
Fast subsets of large datasets with Pandas and SQLite
Get faster pandas with Modin even on your laptops
Heres how to make Pandas Iteration 150x Faster
How to create Pandas Pivot Table and Crosstab
How to Speed up Pandas by 4x with one line of code
Minimally Sufficient Pandas
My Python Pandas Cheat Sheet
One Word of Code to Stop Using Pandas So Slowly
Pandarallel A simple and efficient tool to parallelize your pandas computation on all your CPUs
Pandas analytics server
Pandas and SQL together a Premier League and Player Scouting Example
Pandas Cheat Sheet
Pandas Groupby and Data Handling Tips FIFA Player Data
Pandas in the Premier League
Pandas presentation tips I wish I knew earlier
Pandas profiling and exploratory data analysis with line one of code
Pandas Profiling To Boost Exploratory Data Analysis
Pandas Tutorial 1 Pandas Basics Reading Data Files DataFrames Data Selection
Pandas Tutorial 2 Aggregation and Grouping
Pandas Tutorial 3 Important Data Formatting Methods merge sort reset index fillna
Python Pandas Tutorial Getting Started With DataFrames
Quick dive into Pandas for Data Science
Stop using df dot iterrows
The Easy Way to Extend Pandas API
The Top Five Most Useful Commands in Pandas
Tips for Selecting Columns in a DataFrame
Top 3 Pandas Functions You Dont Know About Probably
Top features of Pandas 1 point 0
Transform Reality with Pandas
Why And How To Use Merge With Pandas in Python
Why and How to Use Pandas with Large Data
20 Great Pandas Tricks For Data Science
10 Pandas methods that helped me replace Microsoft Excel with Python
How to change semi structured text into a Pandas dataframe
Progress Bars:
How to Use Progress Bars in Python
Progress Bars in Python and pandas
Speed Python Up:
10x Faster Parallel Python Without Python Multiprocessing
Are your Python programs running slow Heres how you can make them 7x faster
How to put that GPU to good use with Python
PyPy Faster Python With Minimal Effort
Ten Tricks To Speed Up Your Python Codes
Tips and Tricks:
7 Python Mistakes Data Scientists Should Avoid
8 Advanced Python Tricks Used by Seasoned Programmers
10 Python Tips and Tricks You Should Learn Today
10 simple Python tips to speed up your data analysis
30 Magical Python Tricks to Write Better Code
A few useful tips on how to practice Python
Bookmark this if you are new to Python especially if you self learn Python
Good and Bad Practices of Coding in Python
Python for Data Science 8 Concepts You May Have Forgotten
Python tricks 101 what every new programmer should know
Python Tricks for Keeping Track of Your Data
Top 3 Python Functions You Dont Know About Probably
Top 10 Magic Commands in Python to Boost your Productivity
Top Python Tips and Tricks
Python main folder:
3 Advanced Python Functions for Data Scientists
8 Advanced Python List Techniques You Should Know
8 Python Iteration Skills That Data Scientists Shouldnt Miss Out
9 Skills That Separate Beginners From Intermediate Python Programmers
11 Must Read Blogs for Python Developers
A Data Scientist Should Know At Least This Much Python OOP
A gentle intro to Dash development
A New Python Package How to detect an Anomaly
An introduction to regex using Python
Bite Sized Python Recipes
Everything About Python Beginner To Advanced
I Thought I Was Mastering Python Until I Discovered These Tricks
Learn Enough Python to be Useful argparse
Merging Dictionaries in Python 3 point 9
Python backdoor attacks and how to prevent them
Python in Visual Studio
Python programming languages top uses tools Developers reveal their choices
Pythons Advantages and Disadvantages Summarized
Road to become a Python Ninja Handling Exceptions
The Python Package Dreamteam
Understanding Python Virtual Environments
Use logzero for simple logging in Python
What exactly can you do with Python Here are Pythons 3 main applications
What Is the Walrus Operator in Python
Understand your Python code with this open source visualization tool
SQL:
4 SQL Tips for Data Scientists and Data Engineers
Comparing Python and SQL for Building Data Pipelines
SQL Practical Details Cheat Sheet for Data Analysis
Ten SQL Concepts You Should Know for Data Science Interviews
The Last SQL Guide for Data Analysis Youll Ever Need
Statistics:
A beginners guide to Linear Regression in Python with Scikit Learn
Basic Statistics Every Data Scientist Should Know
Data Science Bayes theorem
P values Explained By Data Scientist
Probability concepts explained Maximum likelihood estimation
Running Chi Square Tests with Die Roll Data in Python
Statistics is the Grammar of Data Science Part 1
Statistics is the Grammar of Data Science Part 2
Statistics is the Grammar of Data Science Part 3
Statistics is the Grammar of Data Science Part 4
Statistics is the Grammar of Data Science Part 5
The 5 Basic Statistics Concepts Data Scientists Need to Know
The 10 Statistical Techniques Data Scientists Need to Master
The Actual Difference Between Statistics and Machine Learning
10 Statistical ConceptsYou ShouldKnow For Data Science Interviews
Tensorflow:
3 ways to create a Machine Learning model with Keras and TensorFlow 2 Sequential Functional and Model Subclassing
9 Things You Should Know About TensorFlow
Beginners guide for TensorFlow The basics of Googles machine learning library
Benchmarking Transformers PyTorch and TensorFlow
Demystifying Tensorflow Time Series Local Linear Trend
Exploring TensorFlow Quantum Googles New Framework for Creating Quantum Machine Learning Models
From TensorFlow to PyTorch
Getting Started With Bounding Box Regression In TensorFlow
Hyperparameter Optimization with Scikit Learn Scikit Opt and Keras
Introducing TensorFlow Addons
Practical Coding in TensorFlow 2 0
DeepLearning AI TensorFlow Developer Professional Certificate
TensorFlow is dead long live TensorFlow
TensorFlow or Keras Which one should I learn
TensorFlow js machine learning for the web and beyond
What is TensorFlow The machine learning library explained
tensorflow plus dalex or how to explain a TensorFlow model
Webscraping:
A beginners guide to web scraping with Python and Scrapy
How To Scrape Web Pages with Beautiful Soup and Python 3
How to build a URL crawler to map a website using Python
How to Build a Web Scraper With Python Step-by-Step Guide
How to not get caught while web scraping
How to Scrape a Tidy Dataset for Analysis
Youtube:
Top 13 YouTube Channels to Learn Data Science
12 Best YouTube Channels to Learn Data Science in 2020
The most impressive Youtube Channels for you to Learn AI Machine Learning and Data Science
These are some of the best Youtube channels where you can learn PowerBI and Data Analytics for free
Data Science Articles Main folder:
Top 15 Websites for Data Scientists to Follow in 2021
GitHub Actions For the Win
5 Ways to Detect Outliers Anomalies That Every Data Scientist Should Know Python Code
10 Python image manipulation tools
25 Machine Learning Startups To Watch In 2019
A couple tricks for using spaCy at scale
A Gentle Introduction to Exploratory Data Analysis
A Step by Step Guide to Making Sales Dashboards
An Implementation and Explanation of the Random Forest in Python
Analysis of car accidents in Barcelona using Pandas Matplotlib and Folium
An overview of Principal Component Analysis
Artificial Intelligence vs Machine Learning
Automated Feature Engineering for Time Series Data
Basic Feature Engineering With Time Series Data in Python
Become Queen Bee for a Day Using Pythons Built in Data Types
Bias Variance Tradeoff Explained
Cheat Codes to Better Visualisations with Plotly Express
Comparing Column Values in Different Excel Files using Pandas
Cookiecutter Data Science Organize your Projects Atom and Jupyter
Cookiecutter Data Science VC Edition Documentation
Creating Powerful Animated Visualizations in Tableau
Cross Validation Why and How
Data Science Has Become About Lending False Credibility To Decisions Weve Already Made
Data Science Minimum 10 Essential Skills You Need to Know to Start Doing Data Science
Data Science MOOCs are too Superficial
Data Science with Optimus Part 1 Intro
Data Science with Optimus Part 2 Setting your DataOps Environment
Decision Tree In Python
Decision Trees An Intuitive Introduction
Difference between type and isinstance in Python
Do you really need a data scientist
Dynamic Meta Embeddings in Keras
Easy Data Analysis Visualization and Modeling using Datasist PART 1
Easy Data Analysis Visualization and Modeling using Datasist PART 2
EfficientDet Scalable and Efficient Object Detection
Exploring your data with just 1 line of Python
Extreme Rare Event Classification A Straight Forward Solution For a Real World Dataset
Getting Started with Text Vectorization
Helping Santa plan with Mixed Integer Programming MIP
Here Are 11 Console Commands Every Developer Should Know
How I replicated an 86 million project in 57 lines of code
How Much Does It Cost to Hire a Data Scientist Hiring Upwork
How PyTorch lets you build and experiment with a neural net
How to analyze log data with Python and Apache Spark
How to Automatically Import Your Favorite Libraries into IPython or a Jupyter Notebook
How to build a bot to automate your mindless tasks using Python and Google BigQuery
How to Build a Simple Machine Learning web app in python
How To Check If a List Is Empty in Python
How to draw insights from cryptocurrencies with machine learning
How To List Every File in a Directory in Python
How to save money with python
Intuitively How Can We Understand Different Classification Algorithms Principles
Is Your Data Center Ready for Machine Learning Hardware
Its Necessary to Combine Batch Norm and Skip Connections
Kaggle vs Colab Faceoff Which Free GPU Provider is Tops
Learn the fundamentals of a good developer mindset in 15 minutes
Logistic Regression The good parts
Machine Learning Project Planning
Multi Class Text Classification with Scikit Learn
Multithreading in Python for Finance
Nvidia GPUs for data science analytics and distributed machine learning using Python with Dask
Objects Counting by Estimating a Density Map With Convolutional Neural Networks
Oktoberfest Quick analysis using Pandas Matplotlib and Plotly
Open Science Open Source and R
Opening Black Boxes How to leverage Explainable Machine Learning
Programming languages: Python developers reveal their favorite tools
PyTorch Lightning 071 Release and Venture Funding
PyViz Simplifying the Data Visualisation process in Python
Randomly Wired Neural Networks
Read Text from Image with One Line of Python Code
Reviewing Python Visualization Packages
Rolling Window Regression a Simple Approach for Time Series Next value Predictions
Software Engineering for Data Scientists
Speed Up Your Exploratory Data Analysis With Pandas Profiling
Stop Using Square Bracket Notation to Get a Dictionarys Value in Python
Strategies for Addressing Class Imbalance
Text Preprocessing in Python Steps Tools and Examples
The 5 Feature Selection Algorithms every Data Scientist should know
The Difference Between Artificial Intelligence Machine Learning and Deep Learning
The Four Levels of Analytics Maturity
The Googles 7 steps of Machine Learning in practice a TensorFlow example for structured data
The Magic Behind One Line Expressions in Python
The relationship between perplexity and entropy in NLP
The Remarkable World of Recommender Systems
Top 10 Predictive Analytics Tools By Category
Understanding Input Output shapes in Convolution Neural Network Keras
Using Python to Get Robinhood Data
Whats the difference between data science machine learning and artificial intelligence
Whats the difference between machine learning statistics and data mining

Bookmarks/Favorites

The following websites are what is found on a website localized on my computer. It serves the same function as the "Favorites" you create in Microsoft Edge.

Data Science:

Kaggle
Stack overflow
Chris Albon
SAS Studio
Visual tutorial education
Tensorflow.org
Anaconda tensorflow user guide
Airtable an Excel alternative comaptible with pandas
google codelabs
Learn Tensorflow
developers google
Notepad++ user manual
Phase AI

Git Related:

GitHub
Gitroyalty.com
Gitroyalty intro
Gitter — Where developers come to talk.

Course Related:

edX Online courses
Datacamp.com
https://course.fast.ai

Work Related:

Data Science Central
Toptal - Hire Freelance Talent from the Top 3%
Upwork - Hire the Right Freelancer

Data Science Books:

Fundamentals of Data Visualization
Data Science at the Command Line
Neural Networks and Deep Learning
Probabilistic Programming and Bayesian Methods for Hackers
Learn Python Break Python
How to Think Like a Computer Scientist: Learning with Python 3
Natural Language Processing with Python
Welcome to Python for you and me
Python Practice Book
A First Course in Linear Algebra
Elementary Applied Topology
Probablilistic Models in the Study of Language
Apache CouchDB (Database)
SQL for Web Nerds
Artificial Intelligence: Foundations of Computational Agents
Deep Learning Book
Learning from Data book assist
Interpretable machine learning
Green Tea Press Free Books

Learning Statistics with R
R for Data Science - O Reilly book
Advanced R
The R Manuals
Ecological Models and Data in R
R by example

Python Websites:

SciPy.org — SciPy.org
Decision Tree Regressor
Scikit-learn
Scikit-learn tutorial
Real Python
Python.org
pep8.org
Practical Business Python
Geopandas.org
Pytorch
jupyter.org
Numpy.org
Pandas.pydata.org

Computers and Technology:

Acrobat family
Computer Electronics Recycling - Recycling Xperts
Free Hosting - Cloud Hosted with cPanel and full PHP Support
Free Online Learning at GCFLearnFree
GoDaddy
Hackerone.com
How to Install PHP on Windows — SitePoint
Installation - FUDforum Wiki
LastPass
LastPass - One Time Passwords
Learn to code Codecademy
Mail.com
Notepad++ Home
PC Flank Make sure you're protected on all sides.
PC Magazine
php Hypertext Preprocessor
PL-SQL Tutorial - PL-SQL programming made easy
sitemaps.org - Home
The W3C CSS Validation Service
The W3C Markup Validation Service
thesitewizard.com
W3Schools Online Web Tutorials
Yubico Trust the Net with YubiKey Strong Two-Factor Authentication Online file conversion
regex101.com regular expressions 101 testing website
regular-expressions.info/
Softwaretested.com

Books

This section lists the books I own and have access to.

The following are screenshots of books I own inside my kindle account. Books which are blacked out are personal and/or confidential:
kindle1
kindle2
kindle3
kindle4
kindle5
kindle6
kindle7

Algorithms:
Algorithms Design by Kleinberg, Jon, Tardos, Eva
Algorithms for Reinforcement Learning by Szepesv´ari, Csaba
Introduction to Algorithms 3rd Edition by Cormen, Thomas H.
Computer Vision: Algorithms and Applications by Szeliski, Richard
Big Data:
Apache CouchDB Release 2.3.1
Big Data Now 2012 Edition by O’Reilly Media, Inc.
Hadoop tutorial by Tutorials point
Data Mining:
Data Mining and Analysis by Zaki, Mohammed J., Meira Jr., Wagner
Data Mining Practical Machine Learning Tools and Techniques 2nd Ed by Ian H. Witten, Eibe Frank
Data Mining Practical Machine Learning Tools and Techniques 3rd Ed by Ian H. Witten, Eibe Frank
A Programmer’s Guide to Data Mining: The Ancient Art of the Numerati by Ron Zacharski
Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman
Data Science in General:
Data Science for Business by Foster Provost and Tom Fawcett
Data Science from Scratch by Joel Grus
Introduction to Data Science by Jeffrey Stanton
Data Scientists at Work by Sebastian Gutierrez
Development Workflows for Data Scientists by Ciara Byrne
Foundations of Data Science by Avrim Blum, John Hopcroft, and Ravindran Kannan
Learning from Data by Abu-Mostafa, Yaser
The Data Analytics Handbook by Brian Liou
Think Like a Data Scientist by Brian Godsey
Data Visualization:
Fundamentals of Data Visualization by Claus O. Wilke
Graph Databases 2nd ed. by Ian Robinson, Jim Webber and Emil Eifrem
Interactive Data Visualization for the Web 2nd ed. by Scott Murray
Storytelling with Data by Cole Nussbaumer Knaflic
The Big Book of DashBoards by Steve Wexler, Jeffrey Shaffer, Andy Cotgreave
Deep Learning:
Deep Learning for the Life Sciences by Bharath Ramsundar, Peter Eastman, Patrick Walters, and Vijay Pande
Generative Deep Learning by David Foster
Deep Learning with JavaScript by SHANQING CAI, STANLEY BILESCHI, ERIC D. NIELSEN, WITH FRANÇOIS CHOLLET
Deep Learning with Python by FRANÇOIS CHOLLET
Deep Learning Cookbook by Douwe Osinga
Introduction to Deep Learning by Eugene Charniak
grokking Deep Learning by Andrew W. Trask
Git:
Git Essentials by Ferdinando Santacroce
Git Tutorial by Tutorials point
Machine Learning:
A Course in Machine Learning by Hal Daumé III
A First Encounter with Machine Learning by Max Welling
An Introduction to Machine Learning Interpretability by Patrick Hall and Navdeep Gill
The Hundred-Page Machine Learning Book by Andriy Burkov
Free and Open Machine Learning Documentation by Maikel
Hands-On Machine Learning with Scikit-Learn and TensorFlow by Aurélien Géron
Introduction to Machine Learning by Amnon Shashua
Introduction to Machine Learning by Alex Smola and S.V.N. Vishwanathan
Machine Learning, Neural and Statistical Classification by D. Michie, D.J. Spiegelhalter, C.C. Taylor
Python Machine Learning Projects by Lisa Tagliaferri, Michelle Morales, Ellie Birbeck, and Alvin Wan
Machine Learning with Python Cookbook by Chris Albon
Machine Learning Yearning by Andrew Ng
Python Machine Learning by Sebastian Raschka
Real-World Active Learning by Ted Cuzzillo
The LION Way by Roberto Battiti, Mauro Brunato
TinyML by Pete Warden and Daniel Situnayake
Understanding Machine Learning: From Theory to Algorithms by Shai Shalev-Shwartz and Shai Ben-David
Math:
Elementary Differential Equations by William F. Trench
Linear Algebra Done Right by Sheldon Axler
Linear Algebra by David Cherney, Tom Denton, Rohit Thomas and Andrew Waldron
Mining the Social Web:
Mining the Social Web 3rd Ed. by Matthew A. Russell and Mikhail Klassen
Social Media Mining by Reza Zafarani, Mohammad Ali Abbasi, Huan Liu
Pandas:
Effective Pandas by Tom Augspurger
Pandas 1.x Cookbook by Matt Harrison and Theodore Petrou
Predictive Analytics:
Applied Predictive Modeling by Max Kuhn, Kjell Johnson
The Path to Predictive Analytics and Machine Learning by Conor Doherty, Steven Camiña, Kevin White and Gary Orenstein
Python:
Python for Data Analysis by Wes McKinney
A Whirlwind Tour of Python by Jake VanderPlas
Black Hat Python by Justin Seitz
A Byte of Python by Swaroop C H
Fluent Python by Luciano Ramalho
Head First Python by Paul Barry
Learning Python 5th Ed. by Mark Lutz
Modeling and Simulation in Python by Allen B. Downey
Python Anti-Patterns by QuantifiedCode
Python Cookbook by David Beazley and Brian K. Jones
Python Crash Course by Eric Matthes
Python Data Science Handbook by Jake VanderPlas
Python for Everybody by Dr. Charles R. Severance
Python for Informatics by Charles Severance
Python Tricks: The Book by Dan Bader
Python Tutorial by tutorialspoint.com
Test-Driven Development with Python by Harry J.W. Percival
The Hitchhiker’s Guide to Python by Kenneth Reitz and Tanya Schlusser
Think Python by Allen Downey
Pytorch:
Programming PyTorch for Deep Learning by Ian Pointer
Pytorch by tutorialspoint.com
R Program:
A Little Book of R For Time Series by Avril Coghlan
An Introduction to Statistical Learning with Applications in R by Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani
Practical Regression and Anova using R by Julian J. Faraway
The R Inferno by Patrick Burns
R Programming for Data Science by Roger D. Peng
Spatial Epidemiology Notes Applications and Vignettes in R by Charles DiMaggio, PhD
Regular Expressions:
Mastering Regular Expressions 3rd Ed. by Jeffrey E. F. Friedl
Regular Expressions Cookbook 2nd ed by Jan Goyvaerts and Steven Levithan
Reniforcement Learning:
Deep Reinforcement Learning Hands-On Second Edition by Maxim Lapan
Reinforcement Learning: An Introduction Second edition, in progress by Richard S. Sutton and Andrew G. Barto
SQL:
Extracting Data from NoSQL Databases by PETTER NÄSHOLM
NoSQL Databases by Christof Strauch
SQL Cookbook by Usman Qadri
SQL Tutorial by tutorialspoint.com
Statistics:
100 Statistical Tests by Gopal K. Kanji
Bayesian Reasoning and Machine Learning by David Barber
Introduction to Probability by Charles M. Grinstead, J. Laurie Snell
Introductory Statistics with Randomization and Simulation by David M Diez, Christopher D Barr, Mine C¸ etinkaya-Rundel
Learning statistics with R: A tutorial for psychology students and other beginners by Danielle Navarro
Probability and Statistics Cookbookby Matthias Vallentin
The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, Jerome Friedman
Think Bayes by Allen B. Downey
Think Stats by Allen B. Downey
Tableau:
33 Ways to Tableau by Ryan Sleeper
Practical Tableau by Ryan Sleeper
Tensorflow:
TensorFlow for Deep Learning by Bharath Ramsundar and Reza Bosagh Zadeh
Think books:
Think Complexity by Allen B. Downey
Think DSP by Allen B. Downey
Think Perl 6 by Laurent Rosenfeld, with Allen B. Downey
Books Main Folder:
A First Course in Design and Analysis of Experiments by Gary W. Oehlert
Automate the Boring Stuff with Python by Al Sweigart
Applied Text Analysis with Python by Benjamin Bengfort, Rebecca Bilbro, and Tony Ojeda
Artificial Intelligence by Stuart J. Russell and Peter Norvig
Cassandra query language by tutorialspoint.com
Disruptive Possibilities How Big Data Changes Everything by Jeffrey Needham
Gaussian Processes for Machine Learning by C. E. Rasmussen and C. K. I. Williams
Information Theory, Inference, and Learning Algorithms by David J.C. MacKay
KB – Neural Data Mining with Python sources by Roberto Bello
Data-Intensive Text Processing with MapReduce by Jimmy Lin and Chris Dyer
Natural Language Processing with Python by Steven Bird, Ewan Klein, and Edward Loper
Physical Modeling in MATLAB by Allen B. Downey
Practical Time Series Analysis by Aileen Nielsen
Programming Pig by Alan Gates
Programming Computer Vision with Python by Jan Erik Solem
Text Analytics with Python by Dipanjan Sarkar
The Little MongoDB Book by Karl Seguin

This site was last updated on August 11, 2021

Copyright @ 2020 All Rights Reserved.

A special thank you to the website thesitewizard.com for the assistance in the creation of this website.

Another special thank you to Natassha Selvaraj for writing her article which inspired the writing of this webpage.