To evaluate the trained graph and the embeddings the task of node classification will be executed. First, the data of cora will be imported into a networkX graph, which will be saved in a pickle file to use it for the training of the embeddings with our EP-SP algorithm. Afterwards the trained embedding will be evaluated with LibLinear L2-Logistic Regression provided from sklearn.
Graph building is provided on cora.py, the evaluation on node_classification.py.
# Required Data
- Cora dataset saved in cora_data for building the graph
...
...
@@ -33,7 +34,10 @@ For node_classification.py
# Running instructions
For cora.py
...
python3 cora.py [-n] [-e] [-o]
-n / --nodes Path to cora file containing nodes
-e / --edges Path to cora file containing edges
-o / --output Path where the graph should be saved
@project: Software Projekt @ Heidelberg University, Institute for Computational Linguistics
@requirements: cora data, numpy, networkX, pickle
@info
Getting a networkx graph from Cora. Graph can be saved in txt file. CARE: numpy-arrays are converted to lists due to errors (NumPy array is not JSON serializable).
Getting a networkx graph from Cora. Graph will be saved in a pickle file.