1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
Updated
Apr 23, 2024 - Python
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Pandas profiling component for Streamlit.
Data Science Feature Engineering and Selection Tutorials
A New Interactive Approach to Learning Data Analysis
Demo from Data Community Bydgoszcz i Toruń, 27.02.2019
Numpy and Pandas are one of the most important building blocks of knowledge to get started in the field of Data Science, Analytics, Machine Learning, Business Intelligence, and Business Analytics. This Tutorial Focuses to help the Beginners to learn the core Concepts of Numpy and Pandas and get started with Machine Learning and Data Science.
In this repository, we would see different available libraries for Exploratory Data Analysis
Jupyter Notebook Templates for quick prototyping of machine learning solutions
Using PyCaret to Predict Apple Stock Prices
Predicting whether or not a person deposits money after a marketing campaign. Gain insights to develop the best strategy in the next marketing campaign
A Python library for day to day data analysis and machine learning. This aims to make data building, cleaning and machine learning much much faster. A library of extension and helper modules for Python's data analysis and machine learning libraries.
Analysis on crime data using pandas
EDA (Exploratory Data Analysis) -1: Loading the Datasets, Data type conversions,Removing duplicate entries, Dropping the column, Renaming the column, Outlier Detection, Missing Values and Imputation (Numerical and Categorical), Scatter plot and Correlation analysis, Transformations, Automatic EDA Methods (Pandas Profiling and Sweetviz).
In statistics, exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task
Pandas
Learn about the MBTA V3 API by building queries and exploring the results
Easy way to analyze your files through web-interface
This is the Streamlit web application that allows users to upload a dataset, generate an automated exploratory data analysis (EDA) report using the pandas-profiling library, and and train a machine learning model for regression or classification tasks.
Univariate, Bivariate and Multi-variate Analysis
Add a description, image, and links to the pandas-profiling topic page so that developers can more easily learn about it.
To associate your repository with the pandas-profiling topic, visit your repo's landing page and select "manage topics."