MemoryError: Unable to allocate 168. GiB for an array with shape (76821, 542, 542) and data type float64 #17

Al-Dailami · 2021-02-22T00:00:13Z

loading training set
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 76821/76821 [02:21<00:00, 543.88it/s]
Traceback (most recent call last):
File "train.py", line 39, in
train_adj, train_mask = preprocess_adj(train_adj)
File "~/TextING/utils.py", line 153, in preprocess_adj
return np.array(list(adj)), mask # coo_to_tuple(sparse.COO(np.array(list(adj)))), mask

Al-Dailami · 2021-02-22T00:01:34Z

Hello

Can you help me fix this problem!!!

Magicat128 · 2021-02-22T01:30:40Z

Hi @Al-Dailami

Which dataset are you using? You may try processing training samples in batches and concatenate them with NumPy.

Al-Dailami · 2021-02-23T04:06:10Z

Thanks a lot for your reply.

I'm working in a dataset that contains around 500,000 record of short texts. Can you please help me on how to modify the code to be able to process the data in batches.

Thanks a lot in advance for your valuable help.

Al-Dailami · 2021-02-24T11:03:07Z

Hello,
I have modified the trainer to process data batch by batch..
Is this a right way?

TextING/train.py

Line 124 in c2492c2

# Construct feed dictionary

b_train_adj, b_train_mask = preprocess_adj(train_adj[idx])
b_train_feature = preprocess_features(train_feature[idx])
feed_dict = construct_feed_dict(b_train_feature, b_train_adj, b_train_mask, train_y[idx], placeholders)
feed_dict.update({placeholders['dropout']: FLAGS.dropout})

Magicat128 · 2021-02-25T02:11:51Z

Hello,
I have modified the trainer to process data batch by batch..
Is this a right way?

TextING/train.py

Line 124 in c2492c2

# Construct feed dictionary

b_train_adj, b_train_mask = preprocess_adj(train_adj[idx])
b_train_feature = preprocess_features(train_feature[idx])
feed_dict = construct_feed_dict(b_train_feature, b_train_adj, train_mask, train_y[idx], placeholders)
feed_dict.update({placeholders['dropout']: FLAGS.dropout})

@Al-Dailami
Yes, you can do it. And it's your b_train_mask in feed_dict rather than train_mask :)

bp20200202 · 2021-04-23T01:57:02Z

Hello,
I have modified the trainer to process data batch by batch..
Is this a right way?

TextING/train.py

Line 124 in c2492c2

# Construct feed dictionary

b_train_adj, b_train_mask = preprocess_adj(train_adj[idx])
b_train_feature = preprocess_features(train_feature[idx])
feed_dict = construct_feed_dict(b_train_feature, b_train_adj, b_train_mask, train_y[idx], placeholders)
feed_dict.update({placeholders['dropout']: FLAGS.dropout})

Hello, I would like to ask if I am still reporting memoryerror after changing the code given by you, have you ever experienced this situation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MemoryError: Unable to allocate 168. GiB for an array with shape (76821, 542, 542) and data type float64 #17

MemoryError: Unable to allocate 168. GiB for an array with shape (76821, 542, 542) and data type float64 #17

Al-Dailami commented Feb 22, 2021

Al-Dailami commented Feb 22, 2021

Magicat128 commented Feb 22, 2021

Al-Dailami commented Feb 23, 2021

Al-Dailami commented Feb 24, 2021 •

edited

Loading

Magicat128 commented Feb 25, 2021

bp20200202 commented Apr 23, 2021

MemoryError: Unable to allocate 168. GiB for an array with shape (76821, 542, 542) and data type float64 #17

MemoryError: Unable to allocate 168. GiB for an array with shape (76821, 542, 542) and data type float64 #17

Comments

Al-Dailami commented Feb 22, 2021

Al-Dailami commented Feb 22, 2021

Magicat128 commented Feb 22, 2021

Al-Dailami commented Feb 23, 2021

Al-Dailami commented Feb 24, 2021 • edited Loading

Magicat128 commented Feb 25, 2021

bp20200202 commented Apr 23, 2021

Al-Dailami commented Feb 24, 2021 •

edited

Loading