Lyuba Dimitrova, Nadia Arslan, Nicolas Weber, Utaemon Toyota
# PROJECT
Softwareprojekt WS2018/19
Betreuerin: Prof. Dr. Anette Frank
Graph Embedding Propagation
# Cora Node Classification
To evaluate the trained graph and the embeddings the task of node classification will be executed. First, the data of cora will be imported into a networkX graph, which will be saved in a pickle file to use it for the training of the embeddings with our EP-SP algorithm. Afterwards the trained embedding will be evaluated with LibLinear L2-Logistic Regression provided from sklearn.
# Required Data
- Cora dataset saved in cora_data for building the graph
- Embeddings for node classification
# Dependencies
For cora.py
-networkx for building the graph
-numpy to save one-hot vocabulary vectors
-pickle to save data in a pickle file
For node_classification.py
-cora.py
-random_nodes_for_node_classification.py for getting random node sets for test, training and validation sets
-pickle
-numpy
-sklearn for evaluation
-random for getting random test, trainings and validation sets
-sys
-argparse
-heapq for getting a heatmap from confusion matrix
Getting a networkx graph from Cora. Graph can be saved in txt file. CARE: numpy-arrays are converted to lists due to errors (NumPy array is not JSON serializable).
Initialize Embeddings for n dimensions with initialize-module.
Arrays are initialized in normal or uniform random format (default = normal).