Skip to content

Commit a45d3e3

Browse files
Update 07-wordEmbed_intro.md
1 parent 9748c56 commit a45d3e3

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

_episodes/07-wordEmbed_intro.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -128,7 +128,7 @@ The learned vectors create useful and meaningful representations of words, captu
128128

129129
## Preliminary Considerations
130130
In determining whether or not Word2Vec is a suitable embedding method for your research, it's important to consider the following:
131-
* **Analysis Relevance**: Does examining the relationships and meanings among words serve as a guideline for your research? Are you able to pinpoint specific terms or clusters of terms that encapsulate the broader conceptual realms you are investigating?"
131+
* **Analysis Relevance**: Does examining the relationships and meanings among words serve as a guideline for your research? Are you able to pinpoint specific terms or clusters of terms that encapsulate the broader conceptual realms you are investigating?
132132
* **Data Quality**: Ensure that your text corpus is of high quality. Garbage or noisy data can adversely affect Word2Vec embeddings.
133133
* **Corpus Size**: Word2Vec performs better with larger corpora. Having substantial text data improves the quality of learned word vectors.
134134
* **Domain-Specific Data Availability**: Choose a dataset relevant to your DH research. If you’re analyzing historical texts, use historical documents. For sentiment analysis, domain-specific data matters.

0 commit comments

Comments
 (0)