You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was trying to diagnose why BioBert is underperforming a standard Bert model for a PubMed task. I took a look at the vocab.txt, and noticed that the words are not what one would expect from PubMed word frequencies.
grep flix vocab.txt returns "Netflix" (46 PubMed occurrences) but not "infliximab" (>13000 PubMed occurrences)
Why is this?
The text was updated successfully, but these errors were encountered:
I was trying to diagnose why BioBert is underperforming a standard Bert model for a PubMed task. I took a look at the vocab.txt, and noticed that the words are not what one would expect from PubMed word frequencies.
grep flix vocab.txt
returns "Netflix" (46 PubMed occurrences) but not "infliximab" (>13000 PubMed occurrences)Why is this?
The text was updated successfully, but these errors were encountered: