Skip to contents

LimpiaR 1.1.0

  • Breaking Change - refactoring limpiar_emojis into limpiar_recode_emojis and limpiar_remove_emojis - limpiar_emojis no longer exists so code using it would need to be updated.
  • Introduced limpiar_non_ascii and limpiar_alphanumeric to make removing special characters quicker and easier
  • Introduced limpiar_wrap_string for reading outputs in {DT}’s Data Tables and {plotly} hovers.
  • Introduced a vignette - near_duplicates.Rmd for using the limpiar_spam_grams function to remove near duplicates.
  • Updated the limpiar_intro.Rmd vignette.
  • behind doors improvements to documentation, error handling, and tests

LimpiaR 1.0.0

Parts of Speech Workflow

  • Import & download models with limpiar_pos_import
    • Multilingual
    • Caching support
  • Annotate models with limpiar_pos_annotate
  • Removal of the limpiar_df function (be mindful about individual cleaning steps!)