Skip to content

Update /en/lessons/analyzing-multilingual-text-nltk-spacy-stanza#3760

Merged
anisa-hawes merged 7 commits intogh-pagesfrom
update-analyzing-multilingual-text-nltk-spacy-stanza
May 8, 2026
Merged

Update /en/lessons/analyzing-multilingual-text-nltk-spacy-stanza#3760
anisa-hawes merged 7 commits intogh-pagesfrom
update-analyzing-multilingual-text-nltk-spacy-stanza

Conversation

@anisa-hawes
Copy link
Copy Markdown
Contributor

@anisa-hawes anisa-hawes commented Apr 28, 2026

I am preparing some updates to the EN lesson analyzing-multilingual-text-nltk-spacy-stanza.

In the process of translation into French, and in response to recent feedback from a reader, Émilien Schultz has prepared a sequence of updates to the lesson + the accompanying Python notebook, and has re-tested the lesson.

Closes #3713

Checklist

  • Assign yourself in the "Assignees" menu
  • Add the appropriate "Label"
  • If this PR closes an Issue, add the phrase Closes #ISSUENUMBER to your summary above
  • Ensure the status checks pass: if you have difficulty fixing build errors, please contact our Publishing Manager @anisa-hawes
  • Check the Netlify Preview: navigate to netlify/ph-preview/deploy-preview and click 'details' (at right)
  • Assign at least one individual or team to "Reviewers"
    • [ ] if the text needs to be translated, please follow the translation request guidelines, then assign the relevant language team(s) as "Reviewers" and tag both the team as well as the managing editor in your PR.

@anisa-hawes
Copy link
Copy Markdown
Contributor Author

Hello Alex @hawc2 and Laura @lachapot,

As you know, a reader from our community recently contacted us about an issue they encountered with language_detector and spaCy 3.8.11 (#3713) when they were teaching using this lesson.

Émilien @emilienschultz is currently editing a French translation of analyzing-multilingual-text-nltk-spacy-stanza, and the same issue was noted in the process of translation and review.

In discussion about the translation, Émilien and I agreed that it is worth updating + re-testing the code now so it works with the current versions of Python and spaCy before publication of the translation. While adding annotations and warnings can be helpful, the most sustainable option is to adapt the lesson so it runs on the latest versions at the time of release.

As that work was being done, I have invited Émilien to prepare the same updates to the English original.

  • In this branch, Émilien has applied the updates EN .md + .ipynb
  • I've added a summary note at the beginning of the lesson to outline the updates
  • I've added lesson-tester and tested-date to the YAML metadata to document Émilien's contribution.

If you're happy with these revisions, we can merge in the updates to the original lesson.

Please note that the .ipynb updates can be reviewed in the files changed tab, but the nbviewer rendered version of the notebook won't display the changes until this PR is merged in.

@anisa-hawes anisa-hawes requested review from hawc2 and lachapot May 7, 2026 18:04
Copy link
Copy Markdown

@lachapot lachapot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, Anisa. I’ve reviewed the updates and everything looks good to me too. I’m very happy to hear the lesson is being updated to the current spaCy version and is being translated and used in classrooms. Thanks to you and Émilien for all your work on this!

@anisa-hawes
Copy link
Copy Markdown
Contributor Author

Thank you, @lachapot + @hawc2.


Enormous thanks to Émilien @emilienschultz for the time invested into preparing and testing these updates.

@anisa-hawes anisa-hawes merged commit a1da6eb into gh-pages May 8, 2026
6 checks passed
@anisa-hawes anisa-hawes deleted the update-analyzing-multilingual-text-nltk-spacy-stanza branch May 8, 2026 11:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Analyzing Multilingual French and Russian Text: Error message when using Language.factory("language_detector")

4 participants