A sample dataset containing text embeddings generated using Hugging Face's
embedding API. This dataset demonstrates the structure of results returned
by the hf_embed_batch()
and hf_embed_df()
functions.
Format
A data frame with 3 rows and 773 variables:
- id
Integer; unique identifier for each text
- text
Character; the original text that was embedded
- category
Character; category classification of the text
- .error
Logical; whether the embedding process failed
- .error_message
Character; error message if embedding failed (NA if successful)
- V1, V2, ..., V768
Numeric; embedding vector dimensions