Skip to content

How to find individual words and their count? #1908

Discussion options

You must be logged in to vote

For this very specific situation, this is what I would do:

$ cat corpus1
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha

$ cat corpus2
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha
alpha alpha1 alpha2 1alpha 2alpha

$ for w in alpha alpha1 alpha2 1alpha 2alpha; do count="$(rg -w "$w" -o | wc -l)"; echo "$w\t$count"; done
alpha   12
alpha1  12
alpha2  12
1alpha  12
2alpha  12

I suspect what you're after is a way …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by BurntSushi
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants