You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
While loading the ClassificationModel() class for RoBERTa model that I have trained on a custom dataset, I am facing an issue where when I am trying to perform model.predict() operation on the text data and even for a small dataset (as less as 20 data points), the model doesn't return the result but keeps running infinitely. Seems to be a problem with multiprocessing as when I load the model the warning says,
UserWarning: use_multiprocessing automatically disabled as xlmroberta fails when using multiprocessing for feature conversion.
To Reproduce
from simpletransformers.classification import (ClassificationModel,) import pandas as pd model_name = "xlmroberta" model_destination = "model_directory" model = ClassificationModel(model_name, model_destination) df = pd.read_pickle(r'pickle_file.pkl') predictions, raw_outputs = model.predict(df.text[0:100].to_list())
Expected behavior
The expected behaviour is that the model should be able to predict for the data points in a smooth manner in a pretty small amount of time and not go into an infinite run. The problem didn't happen when number of data points were less, but as soon as I tried for 100 data points (see the screenshot), it went into a never ending execution. The individual text sentences are not very long with the mean number of characters per sentence being 141 characters.
Screenshots
Desktop (please complete the following information):
Describe the bug
While loading the ClassificationModel() class for RoBERTa model that I have trained on a custom dataset, I am facing an issue where when I am trying to perform model.predict() operation on the text data and even for a small dataset (as less as 20 data points), the model doesn't return the result but keeps running infinitely. Seems to be a problem with multiprocessing as when I load the model the warning says,
To Reproduce
from simpletransformers.classification import (ClassificationModel,)
import pandas as pd
model_name = "xlmroberta"
model_destination = "model_directory"
model = ClassificationModel(model_name, model_destination)
df = pd.read_pickle(r'pickle_file.pkl')
predictions, raw_outputs = model.predict(df.text[0:100].to_list())
Expected behavior
The expected behaviour is that the model should be able to predict for the data points in a smooth manner in a pretty small amount of time and not go into an infinite run. The problem didn't happen when number of data points were less, but as soon as I tried for 100 data points (see the screenshot), it went into a never ending execution. The individual text sentences are not very long with the mean number of characters per sentence being 141 characters.
Screenshots
Desktop (please complete the following information):
The text was updated successfully, but these errors were encountered: