Skip to content

[automated] Update metadata from Papers with Code #5415

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

acl-pwc-bot
Copy link
Collaborator

Auto-generated PR by GitHub action; will be merged automatically.

Update 4659 lines of XML data from Papers with Code.

@acl-pwc-bot acl-pwc-bot force-pushed the automated/update-pwc-metadata branch from e2a3cab to 3d74e64 Compare June 26, 2025 00:49
github-actions[bot]
github-actions bot previously approved these changes Jun 26, 2025
Copy link
Member

@mbollmann mbollmann left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mjpost Something fishy is going on here

@@ -72,7 +72,7 @@
<abstract>This paper describes the ESPnet submissions to the How2 Speech Translation task at IWSLT2019. In this year, we mainly build our systems based on Transformer architectures in all tasks and focus on the end-to-end speech translation (E2E-ST). We first compare RNN-based models and Transformer, and then confirm Transformer models significantly and consistently outperform RNN models in all tasks and corpora. Next, we investigate pre-training of E2E-ST models with the ASR and MT tasks. On top of the pre-training, we further explore knowledge distillation from the NMT model and the deeper speech encoder, and confirm drastic improvements over the baseline model. All of our codes are publicly available in ESPnet.</abstract>
<url hash="be527f4e">2019.iwslt-1.4</url>
<bibkey>inaguma-etal-2019-espnet</bibkey>
<pwcdataset url="https://paperswithcode.com/dataset/librispeech">LibriSpeech</pwcdataset>
<pwcdataset url="https://paperswithcode.com/dataset/librispeech">Abortion pills in kuwait +966505195917 cytotec pills *buy* kuwait</pwcdataset>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is going on here? The sheer number of changed lines is suspicious, and there are things like this in here

@@ -239,7 +238,7 @@
<bibkey>singh-etal-2020-bertnesia</bibkey>
<pwccode url="https://github.com/jwallat/knowledge-probing" additional="false">jwallat/knowledge-probing</pwccode>
<pwcdataset url="https://paperswithcode.com/dataset/lama">LAMA</pwcdataset>
<pwcdataset url="https://paperswithcode.com/dataset/ms-marco">MS MARCO</pwcdataset>
<pwcdataset url="https://paperswithcode.com/dataset/ms-marco">Mtp-Kit (500MG) Prices » Satwa[(+971552965071**)] Abortion Pills In Ajman, Kuwait UAE</pwcdataset>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another one

@@ -1741,7 +1738,7 @@
<url hash="1d6ae405">2020.coling-main.130</url>
<doi>10.18653/v1/2020.coling-main.130</doi>
<bibkey>bai-etal-2020-pre</bibkey>
<pwcdataset url="https://paperswithcode.com/dataset/multinli">MultiNLI</pwcdataset>
<pwcdataset url="https://paperswithcode.com/dataset/multinli">OBAT PENGGUGUR KANDUNGAN DI BANJARBARU (087776558899)</pwcdataset>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This line seems to occur dozens (hundreds?) of times

@mbollmann
Copy link
Member

Screenshot_2025-06-26-10-58-54-87_3aea4af51f236e4932235fdada7d1643.jpg

This actually originates from their website, I emailed [email protected], don't remember who we can tag here

@mjpost
Copy link
Member

mjpost commented Jun 26, 2025

Our original contact was @rstojnic; I wonder if he is still involved.

@mbollmann mbollmann closed this Jul 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants