|
Notation | Meaning |
|
| Account |
p | The percentage of replies, and astroturfing has a relatively low value. |
| |
P | is the probability of word appears, and is the joint probability. |
s | The consecutive sentence |
| Semantic vector representation |
k | The piece of a certain sentence, and K is the total number of pieces |
l | The total quantity of the sentences in the piece |
| Following similarity, and denotes the following similarity of the accounts and |
F | Original tweets set, denotes the original tweets set for the account u |
| Low followers, and astroturfing has a relatively high value. |
| Retweet similarity, and denotes the retweet similarity between the accounts and |
| , is the retweeting time, and is the retweet ID |
| The most dominant application’s percentage |
| The posting frequency, and the astroturfing has a relatively high value |
| The number of received clicks, and astroturfing has a relatively low value |
| The transition probability from the word to |
| Sentence transition, and astroturfing has a relatively high value |
| Word cooccurrence |
| The average score of two consecutive sentences and |
| The best score of two consecutive sentences and |
| Pairwise sentence similarity |
SD | Semantic dispersion, and the astroturfing has a relatively high value |
|