reject exponent notation in lexDecimal by aizu-m · Pull Request #63 · apache/xmlbeans

aizu-m · 2026-06-20T09:54:37Z

XsTypeConverter.lexDecimal("1E5") -> 100000

Spotted this comparing the rich parser path against the validating one. xsd:decimal has no exponent, so "1E5" is not a valid decimal, but lexDecimal passes the value straight to new BigDecimal which happily accepts scientific notation. JavaDecimalHolder.validateLexical already rejects a stray 'E', so the two paths disagree on the same value.

Reachable from untrusted XML through the rich parser getBigDecimalValue / getAttributeBigDecimalValue, which call lexDecimal and only expect a NumberFormatException for bad input. An exponent value therefore comes back as a wrong number (100000 above) rather than the documented lexical error.

Reject e/E in lexDecimal so parsing stays inside the decimal lexical space. Plain decimal forms are untouched. Regression test added in XsTypeConverterTest.

pjfanning · 2026-06-20T10:21:12Z

Again, this is too big a risk in a 20 year old lib. Can we at least have a XmlOptions setting?
Default it to disallow e notation because e notation can be expensive to parse if the values have very high exponent values.
You should also check that this is applied to xs:integer parsing and any other lex methods where BigDecimal is used to parse the numbers.

…Exponent

aizu-m · 2026-06-20T13:54:35Z

Pushed. Default is still to disallow the exponent, but it's now behind XmlOptions.setLoadAllowDecimalExponent so it can be turned back on. Wired it the same way as setLoadStrictFloatingPoint: XmlLocale -> Locale -> the decimal holders, with lexDecimal(cs) strict by default and lexDecimal(cs, true) for the lenient parse.

Checked the integer side like you asked: xs:integer parses through new BigInteger, and that already throws on an exponent (new BigInteger("1E5") -> NumberFormatException), so both lexInteger and the integer holder reject it without any change. lexDecimal is the only lex method that builds a BigDecimal; float/double are meant to accept an exponent, so I left those alone.

Tests cover the strict default, the allow-exponent overload, the integer case, a load round-trip with the option set, and the flag itself. misc.checkin and the schematype/validation suites are green.

reject exponent notation in lexDecimal

c98930e

gate decimal exponent rejection behind XmlOptions.setLoadAllowDecimal…

3c59560

…Exponent

asf-gitbox-commits closed this in 5350bfa Jun 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reject exponent notation in lexDecimal#63

reject exponent notation in lexDecimal#63
aizu-m wants to merge 2 commits into
apache:trunkfrom
aizu-m:lexdecimal-reject-exponent

aizu-m commented Jun 20, 2026

Uh oh!

pjfanning commented Jun 20, 2026

Uh oh!

aizu-m commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aizu-m commented Jun 20, 2026

Uh oh!

pjfanning commented Jun 20, 2026

Uh oh!

aizu-m commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants