OpenRefine is a free, open source power tool for working with messy data and improving it
-
Updated
Dec 23, 2024 - Java
OpenRefine is a free, open source power tool for working with messy data and improving it
Always know what to expect from your data.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative
A full pipeline AutoML tool for tabular data
It is a Natural Language Processing Problem where Sentiment Analysis is done by Classifying the Positive tweets from negative tweets by machine learning models for classification, text mining, text analysis, data analysis and data visualization
This repository contains data and code used to get and clean data from https://github.com/CSSEGISandData/COVID-19 and https://www.worldometers.info/coronavirus/
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
An open-source package for python to clean raw text data
data and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/
Benchmark for bi-level optimization solvers
This repo contains 4 different projects. Built various machine learning models for Kaggle competitions. Also carried out Exploratory Data Analysis, Data Cleaning, Data Visualization, Data Munging, Feature Selection etc
Predicts home prices of Bangalore. Used Flutter, Flask and Jupyter Notebook.
Code to enable OpenRefine to run as an authenticated web service
This project aims to minimize the police response time by detecting weapons through a live CCTV camera feed. So it alerts the police as soon as it detects any sort of weapons. In our project we are focusing on guns primarily. 🔫💣💻🎥
Worked on a dataset of high entropy alloys which is used to design materials for additive manufacturing. Being responsible for Performing Data Analysis and constructing Machine learning algorithms, including neural networks, Gradient boosting for carrying predictions useful for advanced material invention.
distill large scale web page text
Add a description, image, and links to the datacleaning topic page so that developers can more easily learn about it.
To associate your repository with the datacleaning topic, visit your repo's landing page and select "manage topics."