C#代写|Gui代写|数据分析: 这是一个利用C#实现paper中的数据分析界面的代写
Abstract
Social Networks now become popular and powerful
platforms for people to share information. Everyone
may share their interested information with
their connections, or send messages to their friends.
However, sharing information costs both computational
and communicational resources, in addition
to personal time/attention of both the sender
and the receiver. Decision-making regarding which
piece of information should be shared with whom,
thus is important to individuals and the whole network.
In this work, we study the effects of different
information-sharing strategies using a social network
simulator framework. This paper describes
how social network is modeled, and the various factors
relevant to the sharing decisions. We propose
six information sharing strategies, and performed
simulation experiments to examine their influences
on individuals and the whole social network.
keywords: Social Networks, Information Sharing, Strategies,
Simulation.
1 Introduction
Social network here refers to a group of people connected
through Internet via emails, websites, and social media such
as Facebook, Twitter, blogs, and WeChat. Social Networks
have become increasingly popular and fast-growing platforms
for information sharing, job searching and product
marketing [Hogan, 2008; Ellison, 2007]. Information propagates
fast in social network through direct personal social
connections. There are various ways for sharing information,
including direct email message, forwarding a post to some or
all connections, recommending a post/blog/product to some
or all connections, posting/blogging on personal website allowing
a certain group of people to view, etc. Each method
works differently in term of the targeted audience and the implicitly
required degree of attention. A receiver of a direct
email feels more obliged to read the message and reply than
a receiver of an automatically-generated notification of a new
post by a friend.
∗
Given the six degree of separation, small world theory
[Travers and Milgram, 1969], and the fast communication
speed of internet, the information propagated in social network
may reach a very large population within a very short
period of time [Kim et al., 2010]. However, this powerful
and speedy circulation does not come free. The computational
and communicational costs are obvious, the sharing of
information increases the usage of computational resources
and generates large amount of internet traffics. Other lessobvious
costs include the personal time and attention of both
the sender and the receivers. Additionally, the increasing
amount of information reduces the receiver’s attention to any
particular piece of information, hence diminishes the influence
of the shared information or even damages the social
trust between the receiver and the sender. Therefore, it is important
to share the right information with the right person.
However, this is not an easy task if an individual has to
decide that with whom to share for each piece of received
information. The goal of this study is to develop automatic
decision-making mechanisms to help each individual in a social
network. The automatic decision-making is based on
the information relevance and the interest profile of each
connected individual. Each piece of information can be either
manually or more realistically, automatically classified
for its relevance to different categories or subjects, with the
help of context analysis and text mining tools [Aggarwal and
Zhai, 2013], or structured meta-level data such as semantic
web [Gruber, 2008]. On the other hand, each individual
may truthfully describe one’s interested subjects in one’s
published interest profile, given the motivation to reduce undesired
information. The focus of this work is to study different
information-sharing strategies, assuming both information
relevance and interest profile are available.
The uniqueness of this work lies in the aspect of viewing
each node as an intelligent individual and be able to make informed
decision. In traditional research on information propagation
/diffusion, some mathematic models such as Linear
Threshold Model and Independent Cascade Models [Kempe
et al., 2003; Guille et al., 2013] are used to describe the diffusion
process. With such models, each individual is simply
a data object [Smith et al., 2009] behaved according to
a fixed protocol and/or an pre-set attribute value, without
freedom to make its own decision on which information to
share with whom. In recent work on strategic networks with
self-interested agents [Zhang and van der Schaar, 2013], each
node is a strategic agent who benefits from producing and disseminating
information. This is similar to the model used in
our work: each individual may choose different informationsharing
strategy according to its own goal and preference.
This work studies the impact of different information-sharing
strategies not only on the whole network, but also on each
individual in the network. Such decision affects the amount
of information shared, the attention level of the receivers, the
propagation in the network and also the network connection
structure: too many un-appreciated information may cause
disconnection. Individual behavior also has impact on computational
and communication cost, which is a concern for
the society and also should be a concern for everyone, as the
environmental issue.
[Hudack et al., 2014] has studied information diffusion
with game theory. [Horel and Singer, 2015] describes an
adaptive information dissemination method that selects users
aiming to target their influential neighbors. However, our
study has different focus and model than these work.
To simplify the description, in the rest of the paper, we use
message to refer to all types of information one may share
in social network, including post, blog, video, pins, etc. To
send message or forward message means to share a piece of
information with the receiver. In current work, we assume
there is only one method of sharing information; in the future
we will study different sharing methods.
In Section 2 we describe the graph model of social network.
We then present the representation of information relevance
and interest profile of individuals in Section 3. Six
different information-sharing strategies are proposed in Section
4, and the network structure parameters and the experiments
are reported in Section 5. In Section 6 we define
four evaluation criteria for these strategies, and experimental
results are reported in Section 7. Finally we discuss our
conclusion and future work in Section 8.
2 Social Network Structure and Modeling
In this work, we model social network as a graph, with Node
representing individual person, and Edge (link) representing
connection between two persons. In this work, we model social
network as undirected graph, and the connections represent
symmetric relationships, such as friend relationship.
However, some connections in social network are asymmetric,
i.e. following relationship; these asymmetric relations
can be modeled as directed links in future study.
Degree of a node measures the number of connections of
this node. In a social network, nodes may have significantly
different degree: some persons have a lot of connections
while some may only have a few. A cluster, or community
is a group of nodes with many connecting edges between
them, and there are relatively few connecting edges between
nodes that belong to different clusters [Fortunato, 2010;
Golbeck, 2013]. Inside each cluster, a leader node is the one
who has the most number of connections within this cluster.
A leader node may serve as an information hub that connects
to other clusters.
Number of clusters and the size of each cluster have big
Figure 1: A network with 27 small clusters of size 10, 6
medium clusters of size 33, and 2 large clusters of size 57.
influence on the propagation of information in the social network
[Golbeck, 2013]. Hence we choose the following parameters
to characterize the network structure in this work:
1. Number of small clusters, and the size (number of
nodes) of a small cluster.
2. Number of medium clusters, and the size of a medium
cluster.
3. Number of large clusters, and the size of a large cluster.
We build a simulator that takes the above six parameter values
as input and create a network accordingly. Figure 1 shows
a network with 27 small clusters of size 10, 6 medium clusters
of size 33, and 2 large clusters of size 57. With the repelling
property of the nodes in D3 force directed graphs [Kobourov,
2012], nodes in the same cluster attract each other while those
in different clusters repel each other.
3 Information Relevance and Individual
Interest Model
To describe how much an individual in social network is interested
in a piece of information, we introduce the following
model.
Assume there are x categories (subjects) being modeled in
this framework: c1, c2, …, cx, each message m is associated
with a Category List CLm:
CLm = {(cm1, rm1), · · · ,(cmi, rmi), · · · ,(cmx, rmx)}
cmi is the ith category that message m is relevant to, and
Relevance Factor rmi ∈ [0, 1] describes how strong message
m is relevant to category cmi, where 1 stands for the strongest
relevance and 0 means no relevance at all.
For example, a message ma about how to choose running
shoes has a category list as CLma
shown in Table 1, and another
message mb on some health diet and exercise suggestions
for losing weight has a category list as CLmb
shown in
Table 1: Examples of Information Relevance and Individual Interest Model
Relevance of ma CLma {(F itness, 0.5),(Shoes, 0.8)}
Relevance of mb CLmb
{(F itness, 0.4),(Diet, 0.8),(W eightControl, 1.0),(Health, 0.7)}
Interest Profile of np F Lnp {(F itness, 1, 0.6),(Diet, 3, 0.9),(Shoes, 2, 0.9),(Health, 4, 0.8)}
Interest Set of CLma
and F Lnp
ISma,np {F itness, Shoes}
Interest Set of CLmb
and F Lnp
ISmb,np
{F itness, Diet, Health}
Size of Interest Set | ISma,np
| 2
Size of Interest Set | ISmb,np
| 3
Set of Interest Factors I(ma, np) {1, 2}
Set of Interest Factors I(mb, np) {2, 3, 4}
Set of Relevance Factors R(ma, np) {0.5, 0.8}
Set of Relevance Factors R(mb, np) {0.4, 0.8, 0.7}
Average Interest Factor Value Ia(ma, np) 1.5
Average Interest Factor Value Ia(mb, np) 3
Average Relevance Factor Value Ra(ma, np) 0.65
Average Relevance Factor Value Ra(mb, np) 0.63
Average Relevance Threshold Value RTa(ma, np) 0.75
Average Relevance Threshold Value RTa(mb, np) 0.77
Table 1 as well. Such category list can be automatically generated
with natural language processing tools [Aggarwal and
Zhai, 2013].
On the other hand, each individual n has a profile describing
one’s interest, represented as an Interest Factor List:
F Ln = {(cn1, fn1, fnr1), · · · ,(cnj , fnj , fnrj ), · · · ,
(cnx, fnx, fnrx)}
cnj is the jth category that node n is interested in, interest
factor fnj represents how interested node n is in category cnj .
fnj is an integer in the range of [0, 5]. Relevance threshold
fnrj is the minimum value of the relevance factor in category
cnj for a message to be considered as relevant by node
n. For example, an individual np with interest profile F Lnp
shown in Table 1, is interested in four categories: Fitness,
Diet, Shoes and Health. A message must has a relevance factor
value no less than 0.6 for Fitness, or 0.8 for Diet, 0.9 for
Shoes, or 0.8 for Health to be considered relevant by this individual,
to each category respectively.
Each individual in social network may set up his/her interest
profile to describe which categories one is interested and
how much interest one has. One may also adjust the relevance
threshold value dynamically based on the amount of
information one receives and one’s tolerance at that time.
Given a message m with category list CLm, and an individual
node n, Interest Set ISmn is the set of categories that
both message m is relevant to and also node n is interested
in. More formally stated:
ISmn = {c|∃i, cmi ≡ c ∧ rmi > 0 ∧ ∃j, cnj ≡ c ∧ fnj > 0}
Given a message m and an individual node n, the following
parameters are defined based on this intersection set ISmn:
1. size(ISmn) = | ISmn |, number of categories inside
ISmn.
2. I(m, n) = {fnj |cnj ∈ ISmn} the set of the interest
factors, each for one category cnj in ISmn.
3. R(m, n) = {rmi|cmi ∈ ISmn} the set of the relevance
factors, each for one category cmi in ISmn.
0
500
1000
1500
2000
2500
3000
Network1
Network2
Network3
270
1000
2000
198
500
500
114
500
500
#Nodes
Network	Structures
Small	Cluster	 Med.	Cluster	 Large	Cluster
Figure 2: Three Network Structures
4. Ia(m, n) =
X
cnj∈I(m,n)
fnj
size(ISmn)
, the average interest factor
value.
5. Ra(m, n) =
X
cmi∈I(m,n)
rmi
size(ISmn)
, the average relevance factor
value.
6. RTa(m, n) =
X
cmi∈I(m,n)
fnrj
size(ISmn)
, the average relevance
threshold value.
Table 1 shows the above parameter values given example
message ma , ma and individual np.
4 Information-Sharing Strategies
Based on the information relevance and individual interest
model described in Section 3, we propose six informationsharing
strategies, described below.
Table 2: Network Structure Information and Experimental Parameters
# Small Size # Med. Size # Large Size # Seed # Time
Network Clusters Small Clusters Med. Clusters Large #Nodes Messages Steps
#1 27 10 6 33 2 57 582 89 233
#2 100 10 20 25 10 50 2000 301 800
#3 200 10 20 25 5 100 3000 489 1300
• Strategy 1 Even Little Interested (ELI). Send message
m to node n if there exists at least one common category
in m’s category list and also in n’s interest profile, with
a relevance of factor value no less than 0.1. | ISmn |≥ 1
∧ min(R(m, n)) ≥ 0.1
Using Strategy 1 (ELI), both message ma and mb should
be sent to node np.
• Strategy 2 Average Interest in message (AI). Send
message m to node n if the average interest factor of
all categories in ISmn is no less than 3 and the average
relevance factor value of all categories in ISmn is no
less than 0.3.
Ia(m, n) ≥ 3 ∧ Ra(m, n) ≥ 0.3
Using Strategy 2 (AI), mb should be sent to node np but
message ma should not be sent to np.
• Strategy 3 High Interested and Relevance (HIR).
Send a message m to node n if n is very interested
in any category in ISmn, or the average interest factor
Ia(m, n) is no less than 3 and the average relevance factor
value Ra(m, n) is no less than the average relevance
threshold value RTa(m, n).
max(I(m, n)) = 5 or ( Ia(m, n) ≥ 3 ∧ Ra(m, n) ≥
RTa(m, n) )
Using Strategy 3 (HI), neither ma nor mb should be sent
to np.
• Strategy 4 Unless Not Interested (UNI). Send message
m to node n if there is at least one common category j
in m’s category list and also in n’s interest profile.
| ISmn |≥ 1
Using Strategy 4 (UNI), both message ma and mb
should be sent to node np.
• Strategy 5 Combined Interest and Relevance (CIR).
Send message m to node n if the combined average of
the average interest factor value (in percentage) and the
average relevance factor value (in percentage) is no less
than 50%.
Ia(m,n)
5 +Ra(m,n)
2 ≥ 50%
Using Strategy 5 (CIR), mb should be sent to node np
but message ma should not be sent to np.
• Strategy 6 Moderate Interest and Relevance (MIR).
Send message m to node n if if there is at least one common
category j in ISmn that node n’s interest factor for
j is no less than 3 and message m’s relevance factor for
j is no less than 0.5.
∃j ∈ ISmn, fnj ≥ 3 ∧ rmj ≥ 0.5
Using Strategy 6 (MIR), mb should be sent to node np
but message ma should not be sent to np.
5 Network Structure and Experiment Set up
Network structure may has significant influence on information
propagation in the network. In this study, we conducted
experiments with three networks using parameters shown in
Table 2. Network 1 has 27 small clusters with10 nodes each, 6
medium clusters with 33 nodes each, and 2 large clusters with
57 nodes each. In total, Network 1 has 582 nodes, Network
2 has 2000 nodes and Network 3 has 3000 nodes. Figure 2
illustrates the different structures of these three networks. We
created a pool of 10000 randomly-generated interest profiles,
each with random interest factor values and random relevance
threshold values. When a network is created, the specified
number of nodes are generated, each node is associated with
an interest profile drawn from this pool.
In each experiment with a given network, multiple (about
15% of the total number of nodes) seed messages are created,
each with a randomly-generated category list. The experiment
is running with synchronized simulation time steps. At
each time step, one new seed message is delivered to a randomly
selected node in the network. At each time step, each
node makes decision for each message received in the previous
time step. The decision includes whether to share this
message to its connected nodes, and which nodes to share
with, using its information-sharing strategy. This process
continues until a pre-set number of time steps is reached,
which is about 40% of total number of nodes. Table 2 also
reports these experiment parameter values. For example, 489
seed messages were created for Network 3 and the experiment
runs for 1300 time step.
6 Evaluation Criteria
To evaluate the influence of the information-sharing strategy
on individuals and also on the social network, we define the
following criteria.
• Interest Ratio, measures how many messages are interesting
to a node out of all its received contents.
Interest Ratio(n) for node n, is calculated as:
#Interesting(High,Med.,Low) Messages Received by n
#Received Messages of n
Depending on the degree of interest, three measures are
defined as:
– High Interest Ratio: the ratio of highly interesting
messages, with maximum interest factor value
max(I(m, n)) as 5.
– Medium Interest Ratio: the ratio of medium interesting
messages, with maximum interest factor
value max(I(m, n)) as 3 or 4.
0.21
0.13
1
0.21
0.3	 0.35
0.4	 0.61
0
0.39
0.46
0.64
0.38
0.25
0.38
0.23
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR
Interest	Ra?o:		Network	1
High_Int/R	 Med_Int/R	 Low_Int/R
(a) Network 1
0.2
0.12
1
0.2
0.29	 0.33
0.39
0.63
0
0.39
0.45
0.66
0.39
0.24
0.39
0.25
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR
Interest	Ra?o:		Network	2
High_Int/R	 Med_Int/R	 Low_Int/R
(b) Network 2
0.2
0.12
1
0.2
0.29	 0.33
0.4	 0.62
0
0.39
0.45
0.66
0.39
0.24
0.38
0.24
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR
Interest	Ra?o:		Network	3
High_Int/R	 Med_Int/R	 Low_Int/R
(c) Network 3
Figure 3: Interest Ratio: #Interesting(High,Med.,Low)Messages
#ReceivedMessages , Three Networks
0.93
0.12
1	 1	 1
0.59
0.95
0.32
0
1
0.81
0.63
0.93
0.15
0
1
0.42
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR
Reachability	RaBo	:	Network	1
Rcvd/High_Int	 Rcvd/Med_Int	 Rcvd/Low_Int
(a) Network 1
0.92
0.12
1	 1	 1
0.58
0.94
0.35
0
1
0.78
0.63
0.93
0.15
0
1
0.42
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR
Reachability	RaBo	:	Network	2
Rcvd/High_Int	 Rcvd/Med_Int	 Rcvd/Low_Int
(b) Network 2
0.91
0.12
1	 1	 1
0.59
0.92
0.34
0
1
0.79
0.64
0.92
0.15
0
1
0.42
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR
Reachability	RaBo	:	Network	3
Rcvd/High_Int	 Rcvd/Med_Int	 Rcvd/Low_Int
(c) Network 33
Figure 4: Reachability Ratio: #NodesReceivedMessages
#NodesInterested(High,Med.,Low)inMessages , Three Networks
– Low Interest Ratio: the ratio of low interesting
messages, with maximum interest factor value
max(I(m, n)) no more than 2.
The interest ratio for the whole social network is measured
as the average interest ratio of all nodes in this
network.
• Reachability Ratio, measures how many individuals actually
receive the message that they are interested in out
of all individuals who are interested in this message.
Readability Ratio(m) for a seed message m is calculated
as:
#Nodes Interested(High,Med.,Low) in m and Received m
#Nodes Interested(High,Med.,Low) in m
The reachability ratio for one experiment is the average
reachability ratio of all seed messages generated in this
experiment.
• Appreciation Ratio, measures how many messages
forwarded by a node s are appreciated by the receiver
node out of the total number of messages forwarded
by this sender node s. A message m is appreciated
by a receiver node n, if there exist a category j in
ISmn that the receiver’s interest factor fnj is no
less than 3 and the message’s relevance factor rmj
is no less than the receiver’s relevance tolerance fnj .
Appreciation Ratio(s, n) for a sender node s by
receiver node n is calculated as:
#Messages F orwarded by s and Appreciated by Receivern
#Messages F orwarded by s
Appreciation Ratio(s) for a sender node s is the average
appreciation ratio value by all its connected nodes.
In one experiment, the appreciation ratio of each node is
calculated, the average appreciation ratio of all nodes in
this network is measured too. In addition, all nodes in
this network are classified into three categories according
to their appreciation ratio values: above 0.6, between
0.3 and 0.5, below 0.3. Results are reported in Section
7.
• Message Node Ratio, measures the ratio of the total
number of messages to the total number of nodes in
the network, approximately the average number of messages
received by each node during the entire experiment
period. The Message Node Ratio relates to the cost
associated with each message forwarded in the network.
7 Simulation Results
Using the experimental set up described in Section 5, we
conducted 18 experiments with the six information-sharing
strategies proposed in Section 4 and three networks described
in Section 5. Each experiment is conducted with one of the
three networks, and one of the six strategies, which is used
by all nodes in the network. We collected all those measurements
defined in Section 6.
Figure 3 shows the comparisons of Interest Ratio for all six
strategies and for all three networks, Network 1, 2 and 3 from
0.00
0.73	 0.63
0.07	 0.06
0.95
0.66
0.18	 0.36
0.62
0.88
0.05
0.34
0.09	 0.01
0.31
0.05	 0.00
0.32
0.45
0.6
0.32
0.46
0.75
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR
Apprecia@on	Ra@o	and	Classifica@on	of	Nodes,
Network	1
AI/FW	>=0.6	 AI/FW	0.3-0.5	 AI/FW	<0.3	 AI/FW	
(a) Network 1
0.00	
0.52	 0.60	
0.00	 0.07	
0.95	
0.73	
0.43	 0.37	
0.64	
0.89	
0.05	
0.27	
0.04	 0.03	
0.36	
0.04	 0.00	
0.32
0.46
0.62
0.32
0.46
0.76
0%	
10%	
20%	
30%	
40%	
50%	
60%	
70%	
80%	
90%	
100%	
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR	
Apprecia@on	Ra@o	and	Classifica@on	of	Nodes,	
Network		2	
AI/FW	>=0.6	 AI/FW	0.3-0.5	 AI/FW	<0.3	 AI/FW	
(b) Network 2
0.02	 0.09	
0.63	
0.02	 0.08	
0.72	 0.80	
0.85	
0.35	
0.68	
0.89	
0.26	 0.15	
0.06	 0.02	
0.30	
0.03	 0.05	
0.33
0.46
0.62
0.32
0.47
0.75
0%	
10%	
20%	
30%	
40%	
50%	
60%	
70%	
80%	
90%	
100%	
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR	
Apprecia@on	Ra@o	and	Classifica@on	of	Nodes,	
Network	3		
AI/FW	>=0.6	 AI/FW	0.3-0.5	 AI/FW	<0.3	 AI/FW	
(c) Network 3
Figure 5: Appreciation Ratio and Classification of Nodes Accordingly, Three Networks
70	
22	 30	
73	 60	 36	
227	
77	
106	
234	
191	
124	
398	
142	
186	
414	
342	
226	
0	
100	
200	
300	
400	
500	
600	
700	
800	
S1_ELI	 S2_AI	 S3_HIR	 S4_UNI	 S5_CIR	 S6_MIR	
#Messages/#Nodes	
Network1	(582)	 Network2	(2000)	 Network3	(3000)	
Figure 6: Message Node Ratio: #T otalMessages
#Nodes
left to right. Three measurements, High Int/R, Med Int/R,
Low Int/R, each represents the number of High, Medium and
Low interesting messages out of total number of received
messages, respectively. These three ratios add up to 1 by
definition. S3 HIR has High Int/R as 1, and S6 MIR has
Low Int/R as 0, both facts are consistent with how these
strategy work. Both S2 AI and S5 CIR have Low Int/R no
more than 0.25 (1/4), while both S1 ELI and S4 UNI have
Low Int/R greater than 0.33 (1/3). These facts are consistent
over all three networks.
Figure 4 shows the comparisons of Reachability Ratio.
Three measurements, Rcvd/High Int, Rcvd/Med Int, and
Rcvd/Low Int represent the ratio of the nodes who actually
received the message among all nodes that are High, Medium
or Low Interested in the message, respectively. S1 ELI,
S3 HIR, S4 UNI, and S5 CIR all have Rcvd/High Int close
to or equal to 1, meaning that they reach almost all nodes
that are highly interested in the message. In fact, S1 ELI and
S4 UNI reach almost all nodes that are interested in the message.
S5 CIR reaches about 80% medium interested nodes
and 40% medium interested nodes. S6 MIR has moderate
reachability in high and medium interested nodes as around
0.6, and it has 0 reachability among low interested nodes.
S2 AI has the lowest reachability, it only reaches about 12%
high interested nodes, 34% medium interested nodes and 15%
low interested nodes. Again, these observations are consistent
over all three networks.
Figure 5 shows the comparisons of Appreciation Ratio and
the classification of nodes according to their appreciation ratio.
S6 MIR has the highest average appreciation ratio as
≈ 0.75, followed by S3 HIR as ≈ 0.6, then S2 AI and
S5 CIR with ≈ 0.45. S1 ELI and S4 UNI have the lowest
appreciation ratio as ≈ 0.32. S6 MIR also has the highest
ratio of highly appreciated nodes, as 0.95 for Network 1 and
2, 0.80 for Network 3. S3 HIR has a consistent ≈ 0.60 ratio
of highly appreciated nodes over all three networks.
Some of the above facts are obvious given the definition
of the strategy, others are not. Most observations are consistent
over all three different network structures. Figure 6
presents the average number of messages received by each
node. Consider the communicational and computational cost
of information-sharing, S2 AI has the lowest cost, followed
by S3 HIR and then S6 MIR. The three high cost strategies
are S5 CIR, S1 ELI and S4 UNI being the most costly. Also
noted that the message/node ratio increases significantly as
the size of network increases, which can be explained by the
fact that the possible number of connections is the square of
the number of nodes in the network. Therefore, the choice of
information-sharing strategy becomes even more important
for large social networks.
Overall, S3 HIR has low cost, high interest ratio, perfect
reachability among highly interested node while zero among
other nodes, and moderate appreciation ratio ≈ 0.6. S6 MIR
has low cost, moderate interest ratio (no low interest message
delivered), and the highest appreciation ratio. S2 AI has
the lowest cost and moderate appreciation ratio, however, its
reachability is very low.
8 Conclusion and Future Work
In this paper we presented a graph model of social network
and a model of information relevance and node interest.
Based on these models, we proposed six information-sharing
strategies and defined a set of evaluation criteria including the
interest degrees, reachability, appreciation degrees and cost.
We conducted experiments to study the performance of each
strategy in three different networks. Some of the observations
are intuitive given how the strategies work, which in fact verify
that the simulation framework works correctly. In the future
we will study more realistic scenario, where each node
may choose different strategy and even dynamically change
its strategy responding to its environment , i.e. the number
of messages it receives. A node may also choose to different
response to received message depending on its source. We
also plan to model the real communication cost and computational
cost as a function of the number of messages in the
network, in order to study the performance of each strategy in
various settings. Intuitive conclusions rarely can be achieved
in such complicated setting, this experimental study framework
will be indeed appreciated. In the future, we will also
like to explore different methods of sharing information besides
sending messages, and study applying different strategies
with different sharing methods.
References
[Aggarwal and Zhai, 2013] Charu C Aggarwal and ChengXiang
Zhai. Mining text data. Springer Science & Business
Media, 2013.
[Ellison, 2007] Nicole B Ellison. Social network sites: Definition,
history, and scholarship. Journal of ComputerMediated
Communication, 13(1):210–230, 2007.
[Fortunato, 2010] Santo Fortunato. Community detection in
graphs. Physics reports, 486(3):75–174, 2010.
[Golbeck, 2013] Jennifer Golbeck. Analyzing the Social
Web. Elsevier Inc., 2013.
[Gruber, 2008] Tom Gruber. Collective knowledge systems:
Where the social web meets the semantic web. Web semantics:
science, services and agents on the World Wide
Web, 6(1):4–13, 2008.
[Guille et al., 2013] Adrien Guille, Hakim Hacid, Cecile ´
Favre, and Djamel A Zighed. Information diffusion in online
social networks: A survey. ACM SIGMOD Record,
42(2):17–28, 2013.
[Hogan, 2008] Bernie Hogan. Analyzing social networks.
The Sage handbook of online research methods, page 141,
2008.
[Horel and Singer, 2015] Thibaut Horel and Yaron Singer.
Scalable methods for adaptively seeding a social network.
In Proceedings of the 24th International Conference on
World Wide Web, pages 441–451. International World
Wide Web Conferences Steering Committee, 2015.
[Hudack et al., 2014] Jeffrey Hudack, Nathaniel Gemelli,
and Jae C. Oh. Modeling self-interested information diffusion
with game theory on graphs. In ICAART (2), 2014.
[Kempe et al., 2003] David Kempe, Jon Kleinberg, and Eva ´
Tardos. Maximizing the spread of influence through a social
network. In Proceedings of the ninth ACM SIGKDD
international conference on Knowledge discovery and
data mining, pages 137–146. ACM, 2003.
[Kim et al., 2010] Won Kim, Ok-Ran Jeong, and Sang-Won
Lee. On social web sites. Information Systems, 35(2):215–
236, 2010.
[Kobourov, 2012] Stephen G. Kobourov. Spring embedders
and force directed graph drawing algorithms. CoRR,
abs/1201.3011, 2012.
[Smith et al., 2009] Marc A Smith, Ben Shneiderman,
Natasa Milic-Frayling, Eduarda Mendes Rodrigues,
Vladimir Barash, Cody Dunne, Tony Capone, Adam Perer,
and Eric Gleave. Analyzing (social media) networks with
nodexl. In Proceedings of the fourth international conference
on Communities and technologies, pages 255–264.
ACM, 2009.
[Travers and Milgram, 1969] Jeffrey Travers and Stanley
Milgram. An experimental study of the small world problem.
Sociometry, 32(4):425–443, Dec. 1969.
[Zhang and van der Schaar, 2013] Y. Zhang and M. van der
Schaar. Strategic networks: Information dissemination
and link formation among self-interested agents.
IEEE Journal on Selected Areas in Communications,
31(6):1115–1123, June 2013.