| ... | @@ -23,7 +23,11 @@ Options: |
... | @@ -23,7 +23,11 @@ Options: |
|
|
--help Show this message and exit.
|
|
--help Show this message and exit.
|
|
|
```
|
|
```
|
|
|
|
|
|
|
|
**outlier threshold** input field is related with the outlier detection for numerical variables of the incoming dataset. The way that the Data Quality Control tool handles the outlier detection of a certain numerical variable, is that first calculates the **mean** and the **standard deviation** based on the valid values of that column and then calculates the upper and the lower limit by the formula: `upper_limit = mean + outlier threshold * standard deviation`, `lower_limit = mean - outlier threshold * standard deviation`. If any value is outside those limits then it is considered as an outlier.
|
|
**outlier threshold**
|
|
|
|
|
|
|
|
This input field is related with the outlier detection for numerical variables of the incoming dataset. The way that the Data Quality Control tool handles the outlier detection of a certain numerical variable, is that first calculates the **mean** and the **standard deviation** based on the valid values of that column and then calculates the upper and the lower limit by the formula:
|
|
|
|
|
|
|
|
`upper_limit = mean + outlier threshold * standard deviation`, `lower_limit = mean - outlier threshold * standard deviation`. If any value is outside those limits then it is considered as an outlier.
|
|
|
|
|
|
|
|
The report file will be saved in the folder where the incoming dataset file is located.
|
|
The report file will be saved in the folder where the incoming dataset file is located.
|
|
|
|
|
|
| ... | | ... | |