Bug Tracker 
| ID | 309🔗 |
|---|---|
| Submitted by: | Nicola Talbot 🦜 |
| Date: | 2025-08-11 12:55:01 |
| Last update: | 2025-08-13 08:58:39 |
| Status | Closed (Not a Bug) |
| Category | datatool |
| Version | 3.4.1 |
| Summary | \datatool_get_first_grapheme:nN has stopped working |
Sign in to subscribe to notifications about this report.
Description
\datatool_get_first_grapheme:nN internally uses \text_map_inline:nN (which should map over each grapheme in the first argument) to fetch the first grapheme (by breaking the loop at the end of the first iteration). Unfortunately a recent update to the LaTeX kernel has changed the behaviour of \text_map_inline:nN so that it iterates over the tokens rather than the graphemes. This means that with pdfLaTeX (as opposed to LuaLaTeX and XeLaTeX) is now picking up the first octet of a UTF-8 character rather than all the octets that make up the character.
The minimal example produces an "Invalid UTF-8 byte sequence" error. This issue is outside of my control but I have reported it ([Link]).
Note that since \datatool_get_first_grapheme:nN is internally used by \printnoidxglossary provided by the glossaries package, that command also affected.
MWE
Download (218B)
% arara: pdflatex
\documentclass{article}
\usepackage{datatool-base}
\begin{document}
\ExplSyntaxOn
\datatool_get_first_grapheme:nN { élan } \l_tmpa_tl
\tl_show:N \l_tmpa_tl
\l_tmpa_tl
\ExplSyntaxOff
\end{document}
Evaluation
The issue with \text_map_inline:nN has been fixed, but since this is not a bug in datatool I'm closing it as "Not a bug".
Comments
0 comments.
Add Comment
Page permalink: https://www.dickimaw-books.com/bugtracker.php?key=309
