Skip to content

Option to retrieve text instead of document ids in the topic dataset #2

@Pclanglais

Description

@Pclanglais

For some corpora it's more practical to get the actual text (especially since BERTopic works way better on shorter text, length is not really an issue).

(mostly a reminder for myself)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions