-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore: bump version to 0.15.0 #355
Conversation
6f6310d
to
dbbfd69
Compare
Coverage Report
Files without new missing coverage
273 files skipped due to complete coverage. Coverage success: total of 98.06% is above 98.06% 🎉 |
Quality Gate passedIssues Measures |
Changelog
Added
edsnlp.data.read_parquet
now accept awork_unit="fragment"
option to split tasks between workers by parquet fragment instead of row. When this is enabled, workers do not read every fragment while skipping 1 in n rows, but read all rows of 1/n fragments, which should be faster.edsnlp.train
scriptoutput_model_dir
), and whether to save the model or not (save_model
)logger=False
)edsnlp.data.read_conll
and with a specificeds.conll_dict2doc
convertereds.biaffine_dep_parser
) component and metricseds.extractive_qa
component to perform extractive question answering using questions as prompts to tag entities instead of a list of predefined labels as ineds.ner_crf
.Fixed
join_thread
missing attribute inSimpleQueue
when cleaning a multiprocessing executorcls_token_id
andsep_token_id
(we now also look for these tokens in thespecial_tokens_map
andvocab
mappings)random.RandomState()
) when shuffling in data readers : this is important forsub_batch_size
argument ofTrainingData
.