[1904.02101] The Landscape of R Packages for Automated Exploratory Data Analysisopen searchopen navigation menucontact arXivsubscribe to arXiv mailings

The increasing availability of large but noisy data sets with a large number of heterogeneous variables leads to the increasing interest in the automation of common tasks for data analysis. The most time-consuming part of this process is the Exploratory Data Analysis, crucial for better domain understanding, data cleaning, data validation, and feature engineering. There is a growing number of libraries that attempt to automate some of the typical Exploratory Data Analysis tasks to make the search for new insights easier and faster. In this paper, we present a systematic review of existing tools for Automated Exploratory Data Analysis (autoEDA). We explore the features of twelve popular R packages to identify the parts of analysis that can be effectively automated with the current tools and to point out new directions for further autoEDA development.

1 mentions: @y__mattu
Date: 2020/06/27 06:52

Related Entries

Read more Bonfire Data & Science #2 - connpass
0 users, 13 mentions 2020/02/17 02:20
Read more 自然言語処理ナイト - connpass
0 users, 28 mentions 2020/06/15 12:58
Read more ICML2020 因果推論系論文 著者発表会 (オンライン) - connpass
0 users, 50 mentions 2020/06/19 11:21
Read more 再考: お買い得物件を機械学習で見つける方法 / Rethink: Method to Find Cheap Rental Houses by Machine Learning - Speaker D...
0 users, 2 mentions 2020/08/01 09:51
Read more 第88回R勉強会@東京(#TokyoR) - connpass
0 users, 10 mentions 2020/09/07 11:23