Repository navigation
mixed-data
- Website
- Wikipedia
Python library for causal inference and probabilistic modeling.
Repository of a data modeling and analysis tool based on Bayesian networks
ICDM19 - Anomaly Detection / Outlier Detection for Mixed data
Model-based clustering package for mixed data
latentcor is a Python package provides estimation for latent correlation with mixed data types (continuous, binary, truncated and ternary).
More Preformant Gower distance with modern Python tooling
Package for Vector Search over a tabular dataset across multiple columns
IBM Employee Profiling using Clustering
Sentiment analysis using BERT on Hindi-English code-mixed data
In this case study I will be doing Exploratory Data Analytics with the help of a case study on Bank marketing campaign.
Causal discovery from mixed data with missing values.
A capstone project to predict the adoption-speed of listed pets
A Synthetic Data Generator for producing mixed datasets described by relevant, irrelevant, and redundant features.
Reading list of categorical/mixed data analysis
Gower Distance for MATLAB. This repository contains MATLAB implementation of Gower distance calculation for mixed numerical and categorical datasets.
This repository includes the R code used for the project "Mixed-type data clustering: a full factorial benchmarking study on distance-based clustering methods", written by Efthymios Costa. The project is supervised by Dr. Ioanna Papatsouma (Imperial College London) and co-supervised by Professor Alastair Young (Imperial College London).
A simplified algorithm to cluster mixed-type data(numerical and categorical).
A Distance Metric for Clustering Mixed Data Using Graph-Based Feature Influence Balancing Approach.