Jan Philip Wahle's Projects
The official implementation of the paper "Incorporating Word Sense Disambiguation into Neural Language Models".
Data and software for building the ACL Anthology.
The official implementation of the ACL 2023 paper "The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research"
This is the official implementation of the paper "Citation Amnesia: NLP and Other Academic Fields Are in a Citation Age Recession"
The main controller for services in the cs-insights project through docker-compose.
API server of the cs-insights project. This is the main part of storing data and accessing an external data analysis endpoint. It uses a mongoDB instance to store everything and queries the cs-insights-prediction-endpoint to get machine learning results.
This repository implements the interaction with DBLP, information extraction and pre-processing of papers, and a client to store data to the cs-insights-backend.
React frontend of the cs-insights project. This is the main part of visualizing data. It uses the cs-insights-backend and cs-insights-prediction-endpoint.
Python prediction backend of the cs-insights project which does the heavy lifting for analyzing topics and other semantic analysis features using parents and childrens of docker containers that can run on different servers
Uptime tracker for endpoints of the cs-insights project.
The main frontend and backend of the cs-insights project.
The official implementation of the EMNLP 2022 paper "How Large Language Models are Transforming Machine-Paraphrased Plagiarism".
The official implementation of the EMNLP 2023 paper "We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields"
The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"
The official implementation of the iConference 2022 paper "Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection"
The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".
My GitHub Profile Page.
The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research"