Version: Next

Tabular Anomaly Detection (TAD)

Updated 2024.07.06

What is Tabular Anomaly Detection?

TAD (Machine Learning Anomaly Detection) is a system that automatically detects anomalies in data using machine learning.
This system learns normal data patterns and identifies different patterns as anomalies.
It is effective in detecting new, unseen anomalies, contributing to risk management and stability across various industries.

TAD is useful in the following scenarios:

When you want to detect new anomalies.
When the training data consists only of normal data.
When you want to simplify the pipeline from data preprocessing to model development and deployment for anomaly detection.

TAD can be used for anomaly detection modeling in various domains, including:

AutoML Feature: Automatically finds the optimal model without the need for the user to select and adjust models.
Data Preprocessing: Provides various data preprocessing techniques to improve data quality.
Anomaly Detection: Effectively detects new anomalies based on normal data.
User-Friendliness: Allows users to input a few parameters and execute, creating the desired anomaly detection model for the input data.
Code-Free Modeling: Performs various preprocessing and modeling experiments automatically by inputting parameters in a YAML file.
Scalability: Users can add separate machine learning models to be used along with existing models.

Install ALO. Learn more: Start ALO
Use the following git address to install the content. Learn more: Use AI Contents (Lv.1)
Git URL: https://github.com/mellerikat-aicontents/Tabular-Anomaly-Detection.git
Installation code: git clone https://github.com/mellerikat-aicontents/Tabular-Anomaly-Detection.git solution (run inside the ALO installation folder)

Prepare a CSV file containing columns of the data you want to detect anomalies in.
Each column value should be a float, and if there are empty or NaN values, the corresponding row will be automatically excluded. data.csv

x_col_1 x_col_2 time_col(optional) grouupkey(optional) y_col(optional)
value 1_1 value 1_2 time 1 group1 ok
value 2_1 value2_2 time 2 group2 ok
value 3_1 value3_2 time 3 group1 ng
... ... ... ... ...

x_col_1	x_col_2	time_col(optional)	grouupkey(optional)	y_col(optional)
value 1_1	value 1_2	time 1	group1	ok
value 2_1	value2_2	time 2	group2	ok
value 3_1	value3_2	time 3	group1	ng
...	...	...	...	...

By setting only steps 1 and 2 and running ALO, you can create a TAD model.

=> For more advanced parameter settings to create a model that better fits your data, refer to the link on the right. Learn more: TAD Parameter

You can run it in a terminal or a Jupyter notebook. Learn more: Develop AI Solution
The execution results include a trained model file, prediction results, and performance charts.

TAD Version: 1.0.0, ALO Version: 2.5.2