- Introduction: Node classification in networks
- Airport classification Using NetworkX
- Twitch user classification using PyTorch Geometric (PyG)
- Exercise: Classification of research papers in a citation network
- Solution 1: With NetworkX
- Solution 2: With PyG
- Supplementary material: Import a network in PyG
- Understand the task of node classification in networks
- Learn how to prepare network data for node classification tasks
- Learn how to perform node classification using NetworkX
- Learn how to perform node classification using deep learning
- Learn how to evaluate node classification results
This tutorial is aimed at social scientists and data scientists who are interested in analyzing network data. The tutorial is suitable for beginners and intermediate learners with some knowledge in Python and supervised machine learning, such as classification tasks.
Node classification is a common task in social network analysis, aiming to predict the labels of nodes within a network using its structure and properties. Node classification is essential in various applications in computational social science, including social network analysis, role analysis, and trust analysis. For example, in a citation network, we can classify research papers into different research fields. In a social network, we can classify users into different groups based on their social interactions.
The tutorial is designed to be completed in 4-5 hours. The duration may vary depending on the prior knowledge of the participants. The exercises are optional and can be done after the tutorial, which may take 2-3 hours. The entire content can be covered in a full-day workshop.
The tutorial can be run on a laptop or desktop computer. A CPU is sufficient for completing the content in the tutorial and exercises. For large datasets and deep learning models, a GPU is recommended.
Python >= 3.9 is required. In addition, the necessary packages are listed in requirements.txt. Using pip
as an example, install the packages with
pip install -r requirements.txt
Run prep.sh
to download some datasets that are used in the tutorial. Alternatively you can download them manually during the tutorial.
./prep.sh
Dr. Jun Sun @ GESIS