Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

peS2o #26

Open
nkandpa2 opened this issue Nov 16, 2023 · 2 comments
Open

peS2o #26

nkandpa2 opened this issue Nov 16, 2023 · 2 comments
Labels
external project We will be including this data but the work will be done primarily by someone else.

Comments

@nkandpa2
Copy link
Collaborator

Data: https://github.com/allenai/peS2o

@soldni
Copy link
Collaborator

soldni commented Nov 19, 2023

peS2o is mostly done as is. To consider:

  • I will do a refresh with more up-to-date content as we near release deadline.
  • Paragraphs are currently joined using \n\n. As @StellaAthena was mentioning, we wanna keep both raw and formatted options.

@StellaAthena StellaAthena added the external project We will be including this data but the work will be done primarily by someone else. label Jan 8, 2024
@craffel craffel changed the title Pes2o peS2o Feb 5, 2024
@craffel
Copy link
Collaborator

craffel commented Feb 5, 2024

Note that peS2o may have content overlap with arxiv #4 and biorxiv/chemrxiv #28 sources - we will need to dedupe, @soldni suggests based on title.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
external project We will be including this data but the work will be done primarily by someone else.
Projects
None yet
Development

No branches or pull requests

4 participants