Netdimes project
We want to look at the growth of Autonomous Server (AS) networks found in the netdimes project to see how a network grows over time. In particular, we are looking at nodes in the year 2011. First, let us load the data we have downloaded:
In [8]:
Size
Easy stuff first: let’s look at how the size of the network grows over time. Note that this time period is just one year.
In [9]:
<matplotlib.legend.Legend at 0xd21c518>
So it appears as if the size of the AS network id more or less decreasing throughout the year. This is interesting, as I’d expect 2011 to be a year of strong growth of the internet. Nonetheless, this is server networks we are looking at, and 10 months is also a short timeframe.
Degree Distributions
We are going to use networkx
to assemble the internet network.
We will do this first for one month of the dataset:
In [10]:
<matplotlib.legend.Legend at 0x1209d7f0>
We can zoom in at the middle part:
In [11]:
The graphs show that there is a general shift towards the lower left part of the curve as the months go by. This is consistent with the observed decrease in size of the networks. At the same time, the degree distributions of the networks still exhibit power law behavior, which is expected: the degree distribution of the network is a manifestation of the real world behavior of how servers connect to each other.
Clustering coefficient
Let us look at the clustering coefficient. This metric will give us an idea how interconnected nodes are as the year progresses.
In [12]:
<matplotlib.legend.Legend at 0xa7c1748>
Once again, there is a decreasing trend. Since this is a CCDF, we can say that there are a lot more nodes with low clustering coefficients although nodes are decreasing. This might imply that the nodes being removed are those which have a low tendency to form clusters. It makes sense, as you can think of important nodes have a higher tendency to be a part of clusters, and thus servers that are removed in the network are those that are less important.