LSTM is extremely slow ( 2 hours ) #470

lynxionxs · 2019-10-20T07:25:44Z

Hi. I'm trianing an LSTM network that takes 2 hours to train 1000 training data with only 2000 iterations. Why is it so very slow?

// learns if string is like a date

// get training data
const trainingData = [
    {"input":"33 minutes ago","output":"yes"},
    {"input":"20 hours ago","output":"yes"},
    {"input":"May 7 at 13:42 AM","output":"yes"},
    {"input":"Feb 21 at 8:43 AM","output":"yes"},
    {"input":"Jul 22, 2012 ","output":"yes"},
    {"input":"Apr 14, 2018 ","output":"yes"}
    // 1000 total...
]

const network = new brain.recurrent.LSTM();

// create configuration for training
const config = {
    iterations: 2000,
    log: true,
    logPeriod: 200,
    layers: [10],
    log(detail) {
        console.log(detail);
    }
};

network.train(trainingData, config);

const output = network.run('Apr 6, 2014');

console.log(`Is like a date: ${(output == 'yes') ? 'YES' : 'NO' }`);

tymmesyde · 2019-10-22T05:41:11Z

Try to replace your outputs with 0 or 1 instead of 'no' / 'yes' since you only have two type of outputs
Maybe it will speed up the process a little

tymmesyde · 2019-10-22T06:16:04Z

I didn't pay more attention to your trainingSet in my previous answer, but I see that you are trying to train your network with dates in the form of strings.
I suggest you to take a different approach by normalizing your data:

Maybe try to see what the similarities are in your data and use them in your trainingSet as inputs instead of a literal string
Example: hasSpaces, hasNumber, hasStrings, hasCommas, hasYear, ...
(I suggest you add more of those)

Make a short script before, reviewing all your data by checking if it meets the above criteria.
(e.g. if hasSpaces, then your first input value should be 1, and so on)

Then fill your trainingSet like that:

{
   input: [1, 0, 1, 1, 0], output: [0],
   input: [1, 1, 1, 1, 1], output: [1]
}

And when you want to use run, go through to same process

lynxionxs · 2019-10-22T07:44:30Z

@tymmesyde That makes sense now. Thanks

Shubbair · 2019-12-21T11:38:17Z

@tymmesyde thank you so much

ninjaferrari90 · 2020-01-12T14:03:49Z

The training takes too long i.e. somewhere around 14 hours or even more..
My data set is having different output for different input strings..

Is there a way i can reduce my training time? Although i am storing the trained results in JSON and using that while retrieving the output..

Using Node with brain.js version 2.0.0-alpha.11

const trainingData = [
{"input":"How are you","output":"very well"},
{"input":"How have you been","output":"very well"},
{"input":"welcome to new york","output":"thanks"},
{"input":"welcome to our city","output":"thanks"},
{"input":"welcome to usa","output":"thanks"},
{"input":"Lets catchup today","output":"ofcourse"},
{"input":"Lets meet today","output":"ofcourse"}
...... many more....
//large data set with different responses for different scenarios..
]

const network = new brain.recurrent.LSTM();

// create configuration for training
const config = {
iterations: 10000,
log: true,
};

network.train(trainingData, config);

Paulsy10 · 2021-08-30T00:32:50Z

I am having the exact same problem as above. Can someone please help me?

I am getting different outputs for different inputs... and the bot is so... unintelligent.
I just have like 4 text input and outputs to train and it takes like 30 minutes.

tymmesyde · 2021-08-30T12:59:28Z

@ninjaferrari90 @Paulsy10 , this is not a bot, this is a neural net, a component to build your bot.
Read my answer above, you cannot feed plain text data to the neural net, you need to normalize your dataset first.

Paulsy10 · 2021-08-30T21:02:07Z

I am still a bit confused on what you mean by "normalizing"... Do you mean to use numbers instead of strings?

tymmesyde · 2021-08-30T21:03:46Z

I am still a bit confused on what you mean by "normalizing"... Do you mean to use numbers instead of strings?

Read the issue and my comment in response to this issue, this is explained in details.

Paulsy10 · 2021-08-30T21:14:25Z

I am still a bit confused on what you mean by "normalizing"... Do you mean to use numbers instead of strings?

Read the issue and my comment in response to this issue, this is explained in details.
I have a different situation where I am not using dates and just training it to chat. I don't see any similiarties with my text.

tymmesyde · 2021-08-30T21:33:29Z

This is the same issue, you need to normalize your dataset.
It isn't meant to be fed with text data. However you can give it numeric values, ranging from 0 to 1.
You need to find a way to translate those text values into readable ones for the net.

In your case if you just want to make a simple answers bot, you only need to create a dictionary of strings like so:

Questions:

How are you = 0
How have you been = 1
...

Answers:

very well = 0
thanks = 1
...

Then normalize your data by using your dictionary to translate these values to a [0,1] range.

{
    input: [0], // How are you
    output: [0] // very well
},
{
    input: [1], // How have you been
    output: [0] // very well
}
...

More on the matter here:
https://github.com/cazala/synaptic/wiki/Normalization-101
https://github.com/adadgio/neural-data-normalizer
cazala/synaptic#72

Hope it helps.

lynxionxs changed the title ~~LSTM is extremely slow~~ LSTM is extremely slow ( 2 hours ) Oct 20, 2019

lynxionxs closed this as completed Oct 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LSTM is extremely slow ( 2 hours ) #470

LSTM is extremely slow ( 2 hours ) #470

lynxionxs commented Oct 20, 2019

tymmesyde commented Oct 22, 2019

tymmesyde commented Oct 22, 2019

lynxionxs commented Oct 22, 2019

Shubbair commented Dec 21, 2019

ninjaferrari90 commented Jan 12, 2020

Paulsy10 commented Aug 30, 2021 •

edited

Loading

tymmesyde commented Aug 30, 2021

Paulsy10 commented Aug 30, 2021

tymmesyde commented Aug 30, 2021

Paulsy10 commented Aug 30, 2021

tymmesyde commented Aug 30, 2021 •

edited

Loading

LSTM is extremely slow ( 2 hours ) #470

LSTM is extremely slow ( 2 hours ) #470

Comments

lynxionxs commented Oct 20, 2019

tymmesyde commented Oct 22, 2019

tymmesyde commented Oct 22, 2019

lynxionxs commented Oct 22, 2019

Shubbair commented Dec 21, 2019

ninjaferrari90 commented Jan 12, 2020

Paulsy10 commented Aug 30, 2021 • edited Loading

tymmesyde commented Aug 30, 2021

Paulsy10 commented Aug 30, 2021

tymmesyde commented Aug 30, 2021

Paulsy10 commented Aug 30, 2021

tymmesyde commented Aug 30, 2021 • edited Loading

Paulsy10 commented Aug 30, 2021 •

edited

Loading

tymmesyde commented Aug 30, 2021 •

edited

Loading