You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I tried running your generate_data.py code and found that the ACE2005 dataset requires preprocessing. However, when I attempted to preprocess it, I noticed that the code tries to read the doc.txt files from the split folder. But the original data doesn’t contain any split content, nor did I find that the preprocessing code generates the files under the split folder. Could you please clarify whether this is a missing part of the code, or if I haven’t followed the process correctly? I would greatly appreciate it if you could respond to this issue. Thank you very much!
The text was updated successfully, but these errors were encountered:
Thank you very much for your help. Your suggestions resolved my issue with the split in the ACE2005 dataset. However, I still have some questions and would greatly appreciate further clarification. It seems that the link you provided for the E3C dataset is no longer valid, but I found the E3C dataset on the GitHub site in the form of test.txt and train.txt files. Unfortunately, there is no dev file, and it does not correspond to the tsv format mentioned in the config file. There is also no processing file for conversion. In the Casie processing files, there is no split functionality. Can I perform the split myself? Another issue concerns the DIANN data, which consists of txt files that need processing, but I might not know how to handle this, similar to the E3C dataset, which also needs to be converted into tsv files. Once again, I would be very grateful if you could address these questions. Thank you very much!
Hello, I tried running your
generate_data.py
code and found that the ACE2005 dataset requires preprocessing. However, when I attempted to preprocess it, I noticed that the code tries to read thedoc.txt
files from thesplit
folder. But the original data doesn’t contain anysplit
content, nor did I find that the preprocessing code generates the files under thesplit
folder. Could you please clarify whether this is a missing part of the code, or if I haven’t followed the process correctly? I would greatly appreciate it if you could respond to this issue. Thank you very much!The text was updated successfully, but these errors were encountered: