Bug Tracker RSS feed

ID 309🔗
Submitted by: Nicola Talbot 🦜
Date: 2025-08-11 12:55:01
Status Open
Sign in if you want to bump this report.
Category datatool
Version 3.4.1
Summary \datatool_get_first_grapheme:nN has stopped working

Return to Search Results

Sign in to subscribe to notifications about this report.

Description

\datatool_get_first_grapheme:nN internally uses \text_map_inline:nN (which should map over each grapheme in the first argument) to fetch the first grapheme (by breaking the loop at the end of the first iteration).

Unfortunately a recent update to the LaTeX kernel has changed the behaviour of \text_map_inline:nN so that it iterates over the tokens rather than the graphemes. This means that with pdfLaTeX (as opposed to LuaLaTeX and XeLaTeX) is now picking up the first octet of a UTF-8 character rather than all the octets that make up the character.

The minimal example produces an "Invalid UTF-8 byte sequence" error. This issue is outside of my control but I have reported it ([Link]).

Note that since \datatool_get_first_grapheme:nN is internally used by \printnoidxglossary provided by the glossaries package, that command also affected.

MWE

Download (218B)

% arara: pdflatex
\documentclass{article}

\usepackage{datatool-base}

\begin{document}
\ExplSyntaxOn
\datatool_get_first_grapheme:nN { élan } \l_tmpa_tl
\tl_show:N \l_tmpa_tl
\l_tmpa_tl
\ExplSyntaxOff
\end{document}

Evaluation

Comments

0 comments.


Add Comment

Name (optional):

Are you human? Please confirm the bug report ID (which can be found at the top of this page) or login if you have an account. All guest comments have to be manually checked before they appear on the page. There are too many bots trying to spam the site to allow unauthenticated users to post without verification.

Comment:


You can use the following markup:

Block:

[pre]Displayed verbatim[/pre]
[quote]block quote[/quote]

In line:

[tt]code[/tt]
[file]file/package/class name[/file]
[em]emphasized text[/em]
[b]bold text[/b]
[url]web address[/url] [sup]superscript[/sup]
[sub]subscript[/sub]

Ordered list:
[ol]
[li]first item[/li]
[li]second item[/li]
[/ol]

Unordered list:
[ul]
[li]first item[/li]
[li]second item[/li]
[/ul]

You can use the Preview button to review your message formatting before submitting.

Page permalink: https://www.dickimaw-books.com/bugtracker.php?key=309

Return to Search Results