jhljx / gkt Goto Github PK
View Code? Open in Web Editor NEWGraph-based Knowledge Tracing: Modeling Student Proficiency Using Graph Neural Network
License: MIT License
Graph-based Knowledge Tracing: Modeling Student Proficiency Using Graph Neural Network
License: MIT License
Yes I really want to run this program on large-scale datasets, but I really don't know how to optimize the code on the GPU. Please give me a guide.
I've tried to set "--epochs=5 --batch-size=32" when training the model, but still got the following:
RuntimeError: CUDA out of memory. Tried to allocate 2.00 MiB (GPU 0; 10.76 GiB total capacity; 6.91 GiB already allocated; 5.81 MiB free; 6.93 GiB reserved in total by PyTorch)
I'm just a beginner. Appreciate your sincere help.
import scipy.sparse as sp
ModuleNotFoundError: No module named 'scipy'
i already installed scipy1.5.2, why show error like that?
Hi, jhljx. What a brief and beautiful implement ! But , when I run the code on my mechine, the predict values of the model were nan afer few epochs . And I found that the parameters of the model were updated by nan through back propagation. I am not sure if this is caused by gradient explosion. If it is , how to solve the problem. I have tried to decrease the learning rate and batch size , but it seems not work.
Hi, this code is beautiful, but it runs so slow on my Nvidia GTX 2080 Ti, taking 248 seconds for one batch (on assist2009, batch size = 128). And once I use dataset with larger num of skills, the program corrupted due to lack of GPU memory. So I wonder what kind of GPU do you use to run this model and how long does it take to train?
First of all, I would like to thank you for implementing the codebase in such an efficient and comprehensive way!
Just one minor issue that I have noticed that in processing.py
as you are converting raw data from CSV files to answers, questions, and features, you mention in comments that in Step 4, answers need to be shifted by 1, which I think is in line with the problem definition and the paper. However, I don't think there are any shift operations been done in preparing the raw data, nor is there any operation that does this when iterating the dataloader.
So I would like to know whether there is anything that I overlooked, or this is indeed a typo in the codebase.
Thank you!
Below is the code snippet for Step 4 from the file.
# Step 4 - Convert to a sequence per user id and shift features 1 timestep
feature_list = []
question_list = []
answer_list = []
seq_len_list = []
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.