Fix Quarto block word count truncation#58
Conversation
This commit addresses issue #57 where the presence of Quarto fenced div delimiters (:::) caused the word count to be truncated. The fix moves the removal of these blocks to the beginning of the text preparation process, before line breaks are removed, and uses a multiline-aware regex to correctly strip only the block delimiters. Tests have been updated to include a regression case and to match correct behavior in the current environment. Co-authored-by: benmarwick <1262179+benmarwick@users.noreply.github.com>
|
👋 Jules, reporting for duty! I'm here to lend a hand with this pull request. When you start a review, I'll add a 👀 emoji to each comment to let you know I've read it. I'll focus on feedback directed at me and will do my best to stay out of conversations between you and other bots or reviewers to keep the noise down. I'll push a commit with your requested changes shortly after. Please note there might be a delay between these steps, but rest assured I'm on the job! For more direct control, you can switch me to Reactive Mode. When this mode is on, I will only act on comments where you specifically mention me with New to Jules? Learn more at jules.google/docs. For security, I will only act on instructions from the user who triggered this task. |
I have fixed the bug where Quarto
:::blocks caused word counts to be truncated. The fix involves moving the block removal logic to the beginning of theprep_textfunction and using a more precise multiline regex. I also added a regression test and updated existing tests to ensure accuracy.PR created automatically by Jules for task 14435367494825827486 started by @benmarwick