Issue with Split File Handling in ACE2005 Preprocessing Code #24

zdhgreat · 2024-10-13T08:24:20Z

Hello, I tried running your generate_data.py code and found that the ACE2005 dataset requires preprocessing. However, when I attempted to preprocess it, I noticed that the code tries to read the doc.txt files from the split folder. But the original data doesn’t contain any split content, nor did I find that the preprocessing code generates the files under the split folder. Could you please clarify whether this is a missing part of the code, or if I haven’t followed the process correctly? I would greatly appreciate it if you could respond to this issue. Thank you very much!

The text was updated successfully, but these errors were encountered:

osainz59 · 2024-10-27T20:43:22Z

Hi @zdhgreat ,

I am sorry, you can find the split folder in the code of the OneIE paper (from which we obtained the preprocessing script): http://blender.cs.illinois.edu/software/oneie/ .

Thank you for pointing it out, I will add this to the README.

zdhgreat · 2024-11-12T03:33:57Z

Thank you very much for your help. Your suggestions resolved my issue with the split in the ACE2005 dataset. However, I still have some questions and would greatly appreciate further clarification. It seems that the link you provided for the E3C dataset is no longer valid, but I found the E3C dataset on the GitHub site in the form of test.txt and train.txt files. Unfortunately, there is no dev file, and it does not correspond to the tsv format mentioned in the config file. There is also no processing file for conversion. In the Casie processing files, there is no split functionality. Can I perform the split myself? Another issue concerns the DIANN data, which consists of txt files that need processing, but I might not know how to handle this, similar to the E3C dataset, which also needs to be converted into tsv files. Once again, I would be very grateful if you could address these questions. Thank you very much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with Split File Handling in ACE2005 Preprocessing Code #24

Issue with Split File Handling in ACE2005 Preprocessing Code #24

zdhgreat commented Oct 13, 2024

osainz59 commented Oct 27, 2024

zdhgreat commented Nov 12, 2024

Issue with Split File Handling in ACE2005 Preprocessing Code #24

Issue with Split File Handling in ACE2005 Preprocessing Code #24

Comments

zdhgreat commented Oct 13, 2024

osainz59 commented Oct 27, 2024

zdhgreat commented Nov 12, 2024