Page cover

Pythonic Resources

The list below is a collection of Python packages used for visualization and several ML tasks compiled by Thomas L Vincent.arrow-up-right

Data Visualization

General Purpose (Tabular) Machine Learning

  • annoyarrow-up-right - approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk.

  • imbalanced-learnarrow-up-right - a Python package to tackle the curse of imbalanced datasets in Machine Learning. The documentation for this library can be found herearrow-up-right.

  • hummingbirdarrow-up-right - a library for compiling trained traditional ML models into tensor computations. The documentation for this library can be found herearrow-up-right.

  • lifetimesarrow-up-right - a Python library to help model customer behavior and measure Customer Lifetime Value. The documentation for this library can be found herearrow-up-right.

  • metric-learnarrow-up-right - efficient Python implementations of several popular supervised and weakly-supervised metric learning algorithms. The documentation for this library can be found herearrow-up-right.

  • milkarrow-up-right - Machine learning toolkit in Python with a strong emphasis on speed and low memory usage. The documentation for this library can be found herearrow-up-right. Its focus is on supervised classification with several classifiers available: SVMs (based on libsvm), k-NN, random forests, decision trees.

  • pyBrainarrow-up-right - a Python library to develop and implement neural networks. The documentation for this library can be found [here]http://pybrain.org/docs/index.html).

  • pycaretarrow-up-right - a low-code machine learning library in Python that automates machine learning workflows. The documentation for this library can be found herearrow-up-right.

  • pymc3arrow-up-right - a probabilistic programming library for Python that allows users to build Bayesian models with a simple Python API. The documentation for this library can be found herearrow-up-right.

  • scikit-learnarrow-up-right - Multi-purpose Machine Learning library in Python. The documentation for this library can be found herearrow-up-right.

  • statsmodelarrow-up-right - a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests, and statistical data exploration. The documentation for this library can be found herearrow-up-right.

  • XGBoostarrow-up-right - XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable. The documentation for this library can be found herearrow-up-right.

ML Explanability and Feature Interpretation

  • eli5arrow-up-right - a Python library for debugging/inspecting machine learning classifiers and explaining their predictions. The documentation for this library can be found herearrow-up-right.

  • limearrow-up-right - a Python library to help explain the predictions of any machine learning classifier. A more thorough explanation of the methodology is available herearrow-up-right.

  • omniXAIarrow-up-right - a Python machine-learning library for explainable AI (XAI), offering omni-way explainable AI and interpretable machine learning capabilities. The documentation for this library can be found herearrow-up-right.

  • shaparrow-up-right - a game theoretic approach to explain the output of any machine learning model.

  • yellowbrickarrow-up-right - a Python library that provides a suite of visual analysis and diagnostic tools to facilitate machine learning model selection. The documentation for this library can be found herearrow-up-right.

Hyper-parameter Optimization

Time Series

  • Auto_TSarrow-up-right - Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. The documentation for this library can be found herearrow-up-right

  • dartsarrow-up-right - a Python library for easy manipulation and forecasting of time series. It contains a variety of models, from classics such as ARIMA to deep neural networks. The documentation for this library can be found herearrow-up-right.

  • luminolarrow-up-right - a lightweight python library for time series data analysis. The two major functionalities it supports are anomaly detection and correlation. The documentation for this library can be found herearrow-up-right.

  • Prophetarrow-up-right - a library for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. The documentation for this library can be found herearrow-up-right.

  • sktimearrow-up-right - provides an easy-to-use, flexible and modular open-source framework for a wide range of time series machine learning tasks. The documentation for this library can be found herearrow-up-right.

  • statsforecastarrow-up-right - lightning fast forecasting with statistical and econometric models. The documentation for this library can be found herearrow-up-right.

  • tsfresharrow-up-right - automates the extraction of relevant features from time series data. The documentation for this library can be found herearrow-up-right.

  • pyodarrow-up-right - a Python toolkit that provides access to a wide range of outlier detection algorithms for detecting outliers in multivariate data. The documentation for this library can be found herearrow-up-right.

  • pytsarrow-up-right - a Python package dedicated to time series classification. It aims to make time series classification easily accessible by providing preprocessing and utility tools, and implementations of several time series classification algorithms. The documentation for this library can be found herearrow-up-right.

Survival Analysis

Causal Inference

  • Causal MLarrow-up-right - provides a suite of uplift modeling and causal inference methods that allows user to estimate the Conditional Average Treatment Effect (CATE) or Individual Treatment Effect (ITE) from experimental or observational data. The documentation for this library can be found herearrow-up-right.

  • doWhyarrow-up-right - An end-to-end library for causal inference. The documentation for this library can be found herearrow-up-right.

  • EconMLarrow-up-right - applies machine learning techniques to estimate individualized causal responses from observational or experimental data. The suite of estimation methods provided in EconML represents the latest advances in causal machine learning. The documentation for this library can be found herearrow-up-right.

  • scikit-upliftarrow-up-right - an uplift modeling python package that provides fast sklearn-style models implementation, evaluation metrics and visualization tools. The documentation for this library can be found herearrow-up-right.

Recommendation \& Ranking

Natural Language Processing

Computer Vision

  • NiLearnarrow-up-right - makes it easy to use many advanced machine learning, pattern recognition and multivariate statistical techniques on neuroimaging data. The documentation for this library can be found herearrow-up-right.

  • OpenCVarrow-up-right - an open-source library that includes several hundreds of computer vision algorithms. The documentation for this library can be found herearrow-up-right.

Miscalleneous

Last updated