APS Failure at Scania Trucks

Introduction

The dataset consists of data collected from heavy Scania trucks in everyday usage. The system in focus is the Air Pressure system (APS) which generates pressurised air that are utilized in various functions in a truck, such as braking and gear changes. The datasets' positive class consists of component failures for a specific component of the APS system. The negative class consists of trucks with failures for components not related to the APS. The data consists of a subset of all available data, selected by experts.

Abstract

The datasets' positive class consists of component failures for a specific component of the APS system. The negative class consists of trucks with failures for components not related to the APS.

Problem Statement

To predict APS failure accurately and minimizes the cost of failures

Challenge metric

Cost-metric of miss-classification:

Predicted class |    True class   |
                |   pos  |   neg  |
-----------------------------------
     pos        |    -   | Cost_1 |
-----------------------------------
     neg        | Cost_2 |    -   |
-----------------------------------
Cost_1 = 10 and cost_2 = 500

Predicted class |    True class   |
                |   pos  |   neg  |
-----------------------------------
     pos        |   TP   |   FP   |
-----------------------------------
     neg        |   FN   |   TN   |
-----------------------------------
Total_cost = 10*FP + 500*FN

The total cost of a prediction model the sum of "Cost_1" multiplied by the number of Instances with type 1 failure and "Cost_2" with the number of instances with type 2 failure, resulting in a "Total_cost".

 In this case Cost_1 refers to the cost that an unnessecary 
 check needs to be done by an mechanic at an workshop, while 
 Cost_2 refer to the cost of missing a faulty truck, 
 which may cause a breakdown.

 Total_cost = Cost_1*No_Instances + Cost_2*No_Instances.

Number of Instances:

 The training set contains 60000 examples in total in which 
 59000 belong to the negative class and 1000 positive class. 
 The test set contains 16000 examples.

Number of Attributes: 171

Attribute Information:

The attribute names of the data have been anonymized for proprietary reasons. It consists of both single numerical counters and histograms consisting of bins with different conditions. Typically the histograms have open-ended conditions at each end. For example if we measuring the ambient temperature "T" then the histogram could be defined with 4 bins where:

bin 1 collect values for temperature T < -20 bin 2 collect values for temperature T >= -20 and T < 0
bin 3 collect values for temperature T >= 0 and T < 20
bin 4 collect values for temperature T > 20

| b1 | b2 | b3 | b4 |

     -20     0      20

The attributes are as follows: class, then anonymized operational data. The operational data have an identifier and a bin id, like "Identifier_Bin". In total there are 171 attributes, of which 7 are histogram variabels. Missing values are denoted by "na".

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
APS Failure Scania Truck.ipynb		APS Failure Scania Truck.ipynb
README.md		README.md
aps_failure_test_set.csv		aps_failure_test_set.csv
model_performance.xlsx		model_performance.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

APS Failure at Scania Trucks

Introduction

Abstract

Problem Statement

Challenge metric

Number of Instances:

Number of Attributes: 171

Attribute Information:

| b1 | b2 | b3 | b4 |

About

Releases

Packages

Languages

Lala86/Scania_Truck

Folders and files

Latest commit

History

Repository files navigation

APS Failure at Scania Trucks

Introduction

Abstract

Problem Statement

Challenge metric

Number of Instances:

Number of Attributes: 171

Attribute Information:

| b1 | b2 | b3 | b4 |

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages