Allowing other characters in a token #86

carpie · 2019-05-09T21:27:46Z

When using UUIDs in tokens, the tokens are rejected because of the - character in them. I can subclass BooleanAlgebra and override tokenize but it is a lot of duplication for allowing an additional character in the token. It would be nice if one could specify the allowable character set.

The text was updated successfully, but these errors were encountered:

pombredanne · 2019-10-04T07:11:08Z

Thanks for this and sorry for the late reply and review. It kinda makes sense... the rationale for only allowing certain characters is that tokens could then be used as Python-level identifiers and to avid possibly collision with short-form operators (~+| ... etc). In practice this is not big requirement IMHO. In fact in https://github.com/nexB/license-expression/ we accept any characters in tokens and have implemented a few custom tokenizers too.

carpie mentioned this issue May 9, 2019

Allow a custom set of characters in a token #87

Merged

pombredanne added the enhancement label Oct 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allowing other characters in a token #86

Allowing other characters in a token #86

carpie commented May 9, 2019

pombredanne commented Oct 4, 2019

Allowing other characters in a token #86

Allowing other characters in a token #86

Comments

carpie commented May 9, 2019

pombredanne commented Oct 4, 2019