Network Statistics
2.2M
Total Nodes
14.6M
Total Edges
6.5
Avg Degree
Blogging
Category
Size Relative to Repository Maximum
Nodes
2.2M
Edges
14.6M
Nodes & Edges โ Repository Comparison
Highlighted bar = this dataset. Logarithmic scale.
Edge-to-Node Ratio
Network density indicator
Dataset Details
Source
Reza Zafarani*, William D. Cole*, Huan Liu*
Dataset Information
2 files are included:
1. nodes.csv
-- it's the file of all the users. This file works as a dictionary of all the users in this data set. It's useful for fast reference. It contains
all the node ids used in the dataset
2. edges.csv
-- this is the friendship network among the user. The user's friends are represented using edges. Here is an example.
1,2
This means user with id "1" is friend with user id "2".
1. nodes.csv
-- it's the file of all the users. This file works as a dictionary of all the users in this data set. It's useful for fast reference. It contains
all the node ids used in the dataset
2. edges.csv
-- this is the friendship network among the user. The user's friends are represented using edges. Here is an example.
1,2
This means user with id "1" is friend with user id "2".
Attribute Information
This is the data set crawled on July, 2010 from LiveJournal ( http://www.livejournal.com ).
This contains the friendship network crawled. For easier understanding, all the contents are organized in CSV file format.
-. Basic statistics
Number of users : 88,784
Number of friendship pairs: 4,186,390
This contains the friendship network crawled. For easier understanding, all the contents are organized in CSV file format.
-. Basic statistics
Number of users : 88,784
Number of friendship pairs: 4,186,390
Relevant Papers
Reza Zafarani, William D. Cole, and Huan Liu. "Sentiment Propagation in Social Networks: A Case Study in LiveJournal", Advances in Social Computing: Third International Conference on Social Computing, Behavioral Modeling, and Prediction, SBP 2010, Bethesda, MD, USA, March 30-31, 2010, Proceedings, pp. 413-420.
How to Cite
If you publish material based on data from this repository, please acknowledge the Data Lab Social Computing Data Repository at Syracuse University in your acknowledgements. This helps others find and replicate your work.
APA Format
R. Zafarani and H. Liu. (2026). Social Computing Data Repository [https://datasets.syr.edu]. Data Lab, Syracuse University.
@misc{Data Lab:SU,
author = {R. Zafarani and H. Liu},
year = {2026},
title = {Social Computing Data Repository},
url = {https://datasets.syr.edu},
institution = {Data Lab, Syracuse University}
}
author = {R. Zafarani and H. Liu},
year = {2026},
title = {Social Computing Data Repository},
url = {https://datasets.syr.edu},
institution = {Data Lab, Syracuse University}
}