深度学习—训练集、验证集和测试集概念

合集下载

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

Training, Validation and Test Data

Example:

(A)We have data on 16 data items , their attributes and class labels.

RANDOMLY divide them into 8 for training, 4 for validation and 4 for

testing.

Class Training Item No. d–

Attributes

1.0

2.0

3.KNOWN FOR ALL1

4.1

5.DATA ITEMS1

6.1

7.0

8.0

Validation 9.0

10.0

11.1

12.0

Test 13.0

14.0

15.1

16.1

(B). Next, suppose we develop, three classification models A, B, C from

the training data. Let the training errors on these models be as shown

below (recall that the models do not necessarily provide perfect

results on training data—neither they are required to).

Classification results from Item No.d- Attributes True Class Model A Model B Model C

1.0011

2.ALL KNOWN0000

3.1010

4.1101

5.1000

(C). Next, use the three models A, B, C to classify each item in the validation set based on its attribute vales. Recall that we do know their true labels as well. Suppose we get the following results:

would select model C.

(D). Now use model C to determine class values for each data point in the test set. We do so by substituting the (known) attribute value into the classification model C. Again, recall that we know the true label of each of these data items so that we can compare the values obtained from the classification model with the true labels to determine classification error on the test set. Suppose we get the following results.

(E). Based on the above, an estimate of generalization error is 25%.

What this means is that if we use Model C to classify future items for which only the attributes will be known, not the class labels, we are likely to make incorrect classifications about 25% of the time.

(F). A summary of the above is as follows:

Model Training Validation Test A2550 ---

B50----

C2525 Cross Validation

If available data are limited, we employ Cross Validation (CV). In this approach, data are randomly divided into almost k equal sets. Training is done based on (k-1) sets and the k-th set is used for test. This process is repeated k times (k-fold CV). The average error on the k repetitions is used as a measure of the test error.

For the special case when k=1, the above is called Leave- One –Out-Cross-Validation (LOO-CV).

EXAMPLE:

Consider the above data consisting of 16 items.

(A). Let k= 4, ., 4- fold Cross Validation.

Divide the data into four sets of 4 items each.

Suppose the following set up occurs and the errors obtained are as shown.

Set 1 Set 2 Set 3 Set 4

Training Items 1 -

12Items 1 - 8

13-16

Items 1 - 4

9-16

Items 5-16

Test Items 13-

16Items 9-12Items 5 - 8Items 1 –

Error on

test set

(assume)

25%35%28%32%