Comments (3)
Hi :) thank you for your interest.
The jsons are retrieved using the semanticscholar python package in a 5-step process described in the blogpost and paper. This process included manual steps (i.e., I looked at papers by myself and curated them), so I can't give you a code snippet that does everything automatically.
Step 1 and step 4 were the automatic steps, and are very simple to do in a few lines of code. The process for the retrieval in these steps is as follows:
Step 1: Search semanticscholar for every keyword, then verify if the title or abstract contains at least two keywords.
Step 4: Go over your list of papers, and for every paper they cite or reference, retrieve its json file and verify if the title or abstract contains at least two keywords.
I uploaded a notebook that contains the code for both steps in commit ebb3c59 . The file is Retrieval.ipynb
. Hope that helps!
from xai-scholar.
@alonjacovi thanks for the response, I was actually going through the Semantic scholar Api and now checked the code you gave.
There is an argument in the API as "year", you didn't use it in your retrieval and you got till 2022 and 14 papers from 2023.
Even if we don't use it we get all papers till date?
from xai-scholar.
@Dipankar1997161 I executed my retrieval in December 2022. Some papers have the year 2023 because SemanticScholar recorded their year of publication before they were "officially" published. If you retrieve papers from the API, you should get all papers that match your query regardless of year. But this is related to the SemanticScholar API that I have no control over, so any further details you may have to ask them :)
Closing this issue for now as it seems the question has been answered, feel free to reopen if necessary
from xai-scholar.
Related Issues (1)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from xai-scholar.