Repository navigation

#

big-data-analytics

Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.

Java
1978
1 年前

PySpark-Tutorial provides basic algorithms using PySpark

Jupyter Notebook
1218
3 个月前

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++
881
1 个月前

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

TypeScript
641
4 个月前

Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platform for querying ClickHouse databases, executing queries, and visualizing metrics about your instance.

TypeScript
340
1 个月前

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

Python
330
2 个月前

A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.

Fortran
274
1 个月前

Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.

Python
161
4 年前

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Scala
143
1 年前

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

Scala
94
4 年前

This is about learning courses in Coursera. All the answers given written by myself

HTML
92
5 年前

I have built the computer vision models in 3 different ways addressing different personas, because not all companies will have a resolute data science team. quality-control manufacturing big-data-analytics jupyter-notebook cognitive services industry solutions

Jupyter Notebook
81
4 年前

Bucketize an image based on exhaust data and AI generated data. industry-solutions azure azure machine learning services computer-vision big data big data analytics machine learning image recognition manufacturing quality control cognitive services

Python
79
6 年前

Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.

Jupyter Notebook
57
1 年前

Egis - a handy Ruby interface for AWS Athena

Ruby
42
3 年前