You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
when parsing pdf files, words containing "ff" are not properly saved in the word table through pdfparser.
For example "puff" is saved as "pu"
It only happens with words in the content, not in the title. IOW, if "stuff" is a title word, then "stuff" is saved. If "stuff" is in the content, then only "stu" is saved.
Upon further testing, it appears that multiple "f" are treated as a space, anywhere in a word.
The word boardfflamps is parsed and saved as 2 words "board" and "lamps"
The word Cablefknoss is parsed and saved correctly as "cablefknoss"
The text was updated successfully, but these errors were encountered:
when parsing pdf files, words containing "ff" are not properly saved in the word table through pdfparser.
For example "puff" is saved as "pu"
It only happens with words in the content, not in the title. IOW, if "stuff" is a title word, then "stuff" is saved. If "stuff" is in the content, then only "stu" is saved.
Upon further testing, it appears that multiple "f" are treated as a space, anywhere in a word.
The word boardfflamps is parsed and saved as 2 words "board" and "lamps"
The word Cablefknoss is parsed and saved correctly as "cablefknoss"
The text was updated successfully, but these errors were encountered: