Couldn't fit the data using NEURAL NETWORKS IN MATLAB (fitnet function)

Question

1 vote

Hi,

i have been trying the fit the data to a nonlinear model using neural networks in matlab. i have several sets of data. and my code is working fine for some data sets but not for all the data sets.

for some data sets i am able to fit with good regression coefficient.for some sets of data it is giving me a constant value of output (i.e: almost '0' regression coefficient).

this is the architecture of my neural network: Feedforward neural network with back propagation. no of hidden layers-1 no of neurons in a hidden layer -i am varying to see the result in each run

CAN ANYONE POINT OUT WHAT IS GOING WRONG IN MY CODE PLEASE???

      clear;
      %%to load data from excel file
      filename='dD.xlsx';
      x=xlsread(filename);
      p=x(:,2:12);
      t=x(:,1);
      inputs = p';
      targets = t';
      % rng(200,'v4');
      rng(0)
      %%Create a Fitting Network
      hiddenLayerSize = 5;
      net = fitnet(hiddenLayerSize);
      %%Set up Division of Data for Training, Validation, Testing
      % net.divideFcn = 'dividetrain';  % No validation or test data
      net.divideParam.trainRatio = 100/100;
      net.divideParam.valRatio = 0/100;
      net.divideParam.testRatio = 0/100;
      net.trainFcn = 'trainbr';
      % net = configure(net,ptrans,tn);
      net.layers{1}.transferFcn = 'logsig';
      net.layers{2}.transferFcn = 'purelin';
      %%Train the Network
      [net,tr] = train(net,inputs,targets);
      tr.best_epoch;
      effective_param = tr.gamk;
      effective_no_of_parameters = effective_param(length(effective_param));
      wt_IL=net.IW{1,1};
      wt_HL= net.LW{2,1};
      bias_IL=net.b{1};
      bias_HL=net.b{2};
      %%Test the Network
      outputs = net(inputs);
      errors = gsubtract(outputs,targets);
      performance = perform(net,targets,outputs)

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Greg Heath on 12 Mar 2015

Open in MATLAB Online

1 vote

1. The best approach for us to help is to

a. Apply your code to the MATLAB data set that provides the best example of your problem

 help nndatasets
 doc nndatasets

b. Since you have 11 inputs and 1 output, consider one of the following

 abalone_dataset       8  -> 1 ,  4177  Abalone shell rings dataset.
 bodyfat_dataset      13  -> 1 ,   252   Body fat percentage dataset.
 chemical_dataset      8 -> 1  ,   498   Chemical sensor dataset.
 house_dataset        13 ->  1  ,  506   House value dataset

2. Record the initial RNG state so results can be duplicated

3. Obtain the normalized mean-square error and tabulate the corresponding Rsquare (see Wikipedia) resulting from a double loop approach to determining the number of hidden nodes (outer loop h = Hmin:dH:Hmax) and initial random weights ( inner loop i = 1:Ntrials)

   MSE00  = mean(var(target',1))  % Reference MSE (optimal for constant output)
   NMSE    = mse(target-output)/MSE00
   Rsquare = 1 - NMSE

4. Typically, my goal is to use

   i) numel(Hmin:dH:Hmax) = 10; Ntrials = 10
   ii) Minimize the number of hidden nodes subject to the constraint that the net models at least 99% of the mean target variance, i.e., Rsquare > 0.99.

5. You wrote that you have varied h and made multiple runs. However, you did not

 a. State the size of your data set
 b. Tabulate your results (Rsquare is preferred, but NMSE is ok)

6. I have posted scores of examples in both NEWSGROUP and ANSWERS. A useful search combination is

 greg fitnet Ntrials
 7. The no-overfitting condition Hmax << Hub in my posts is not necessary when regularization (trainbr and or msereg) is used.

Hope this helps.

Thank you for formally accepting my answer

Greg

11 Comments
Show 9 older comments Hide 9 older comments

Greg Heath on 12 Mar 2015

Open in MATLAB Online

% I couldn't understand some of your suggestions % % can you please explain a bit more about "Record the initial RNG state so % results can be duplicated" how to do perform this operation??

You have already done it by explicitly specifying rng(0)

% As i have less data i am using my entire dataset as training set only.

That is dangerous because then it is hard for you to convince your teacher, sponsor or client that you have an unbiased estimate of performance on new or unseen data.

If this is serious work, you need to go back and get unbiased estimates using a test set. I recommend using 10 fold cross-validation enough times to create a good R2 histogram and the best 30 or more designs can be used to obtain convincing summary R2 statistic estimates.

Since initial weights are random, you will get some unsatisfactory designs which you should include in the histogram but exclude from the summary statistics.

I used a slightly modified code on the bodyfat_data set for H = 16:-1:0

             logsigresult              tansigresult 
           16      0.78952           16      0.79693
           15      0.78939           15      0.78933
           14      0.78923           14      0.78934
           13      0.78924           13      0.79693
           12      0.78951           12      0.79694
           11      0.78927           11      0.79694
           10       0.7895           10      0.79694
            9      0.78926            9      0.79695
            8      0.78956            8      0.79695
            7      0.78957            7      0.79696
            6      0.78948            6      0.79697
            5      0.78934            5      0.79699
            4      0.78958            4      0.79709
            3      0.77613            3      0.77828
            2      0.76794            2      0.76788
            1      0.75763            1      0.75741
            0     -0.20245            0      0.74731

It looks like the default tansig with H = 4 is a reasonable choice.

% i didn't use any pre processing commands in the above code. when i tried % using some pre processing tools like MAPMINMAX and PROCESSPCA but results % got varied (some results were good, some were bad).

MAPMINMAX is the default. However, I prefer explicitly using ZSCORE before training to delete or modify outliers. However, since it is too painful to continually remove MAPMINMAX I just use both.

In other high dimensional problems I have found decent input reduction just using a linear model and STEPWISEFIT. The weaker the model, the better it tends to choose good inputs. Also, original inputs are kept instead of linear combinations.

% can you please explain me about what are all the pre processing and post % processing tools are included in the function FITNET..i am not able to % get much information in MANUAL???

When you type, before and after training, WITHOUT THE ENDING SEMICOLON

net = net

you will see how to get that and other net details. Similarly

tr = tr

will yield training details

% i attached the size, no of neurons i used and R_square coefficients i got % for different data sets while running the same CODE

OK, will look at it.

% Thanks for the help

Will be looking for my check in the mail (;>)

Greg Heath on 17 Mar 2015

This change stops training when the training error variance is less than 1% of the original target variance.

You can make it smaller (e.g., by a factor of 2 or 10) if you wish.

TRAINBR doesn't have a different transfer function.

Kishore on 17 Mar 2015

Thanks for all these valuable comments.

Sign in to comment.

Couldn't fit the data using NEURAL NETWORKS IN MATLAB (fitnet function)

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

11 Comments
Show 9 older comments Hide 9 older comments

More Answers (0)

Categories

Products

Tags

Community Treasure Hunt

Couldn't fit the data using NEURAL NETWORKS IN MATLAB (fitnet function)

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

11 Comments Show 9 older comments Hide 9 older comments

More Answers (0)

Categories

Products

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

11 Comments
Show 9 older comments Hide 9 older comments