RoBERTa model Prediction taking Infinite time with no error but Multiprocessing disabled warning #1558

etqadkhan · 2023-11-24T16:57:53Z

Describe the bug
While loading the ClassificationModel() class for RoBERTa model that I have trained on a custom dataset, I am facing an issue where when I am trying to perform model.predict() operation on the text data and even for a small dataset (as less as 20 data points), the model doesn't return the result but keeps running infinitely. Seems to be a problem with multiprocessing as when I load the model the warning says,

UserWarning: use_multiprocessing automatically disabled as xlmroberta fails when using multiprocessing for feature conversion.

To Reproduce

from simpletransformers.classification import (ClassificationModel,)
import pandas as pd
model_name = "xlmroberta"
model_destination = "model_directory"
model = ClassificationModel(model_name, model_destination)
df = pd.read_pickle(r'pickle_file.pkl')
predictions, raw_outputs = model.predict(df.text[0:100].to_list())

Expected behavior
The expected behaviour is that the model should be able to predict for the data points in a smooth manner in a pretty small amount of time and not go into an infinite run. The problem didn't happen when number of data points were less, but as soon as I tried for 100 data points (see the screenshot), it went into a never ending execution. The individual text sentences are not very long with the mean number of characters per sentence being 141 characters.

Screenshots

Desktop (please complete the following information):

OS - Debian 11
Environment - PyTorch 2.0 (with Intel MKL-DNN/MKL) [Vertex AI Workbench env]
GPU - NVDIA T4
Machine Type - 8vCPU 52 GB RAM
simpletransformers==0.64.3
transformers==4.35.2
torch==2.0.0+cu118
CUDA version - 11.8

The text was updated successfully, but these errors were encountered:

etqadkhan mentioned this issue Nov 24, 2023

fixed multiprocessing issue during RoBERTa prediction - Solution to #1558 #1559

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RoBERTa model Prediction taking Infinite time with no error but Multiprocessing disabled warning #1558

RoBERTa model Prediction taking Infinite time with no error but Multiprocessing disabled warning #1558

etqadkhan commented Nov 24, 2023

RoBERTa model Prediction taking Infinite time with no error but Multiprocessing disabled warning #1558

RoBERTa model Prediction taking Infinite time with no error but Multiprocessing disabled warning #1558

Comments

etqadkhan commented Nov 24, 2023