Machine Learning Times
Machine Learning Times
EXCLUSIVE HIGHLIGHTS
Three Best Practices for Unilever’s Global Analytics Initiatives
    This article from Morgan Vawter, Global Vice...
Getting Machine Learning Projects from Idea to Execution
 Originally published in Harvard Business Review Machine learning might...
Eric Siegel on Bloomberg Businessweek
  Listen to Eric Siegel, former Columbia University Professor,...
Effective Machine Learning Needs Leadership — Not AI Hype
 Originally published in BigThink, Feb 12, 2024.  Excerpted from The...
SHARE THIS:

10 years ago
Top LinkedIn Groups in 2014 for Analytics, Big Data, Data Mining, and Data Science

 

We analyze Top 30 LinkedIn Groups for Analytics, Big Data, Data Mining, and Data Science. Overall activity drops about 25%, but membership growth accelerates in Q4 2013. We identify 4 group quadrants and find which groups are fastest growing and most active.

We update our analysis of Top 30 LinkedIn Groups for Analytics, Big Data, Data Mining, and Data Science (Dec 2013) and find several interesting trends.

First, we found that growth slowed down in 2013 Q3 but resumed in 2013 Q4 and 2014 Q1.

The Figure 1 (below) shows quarterly growth rates in top 30 groups. Except for two groups: Machine Learning and SAS & Analytics Users (not shown in Figure 1) which had big growth in 1 or 2 quarters and none in 2 other quarters, most groups show surprisingly similar pattern of decline in growth in 13Q3, followed by acceleration in 14Q1 and 14Q2.

Top Linked Analytics Groups, Growth 2013Q2 to 2014Q1
Fig 1: Top Linked Analytics Groups, Quarterly Growth 2013Q2 to 2014Q1. Thick black line is the overall average growth rate.

Here are the 10 largest groups (by membership as of March 31, 2014). We note that 7 largest were in the same order as in Nov 2013. The 6 largest grew significantly faster than the next 4 groups.

  • Advanced Business Analytics, Data Mining and Predictive Modeling: 121,816 (74% growth in 12 months)
  • Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group: 95,638 ( 82% growth)
  • Big Data and Analytics: 74,350 (100%)
  • Business Analytics: 53,345 (43%)
  • Data Mining, Statistics, Big Data, and Data Visualization: 43,761 (116%)
  • BIG DATA Professionals – Architects Scientists Analytics Experts: 30,792 (92%)
  • Next Gen Market Research (NGMR): 23,368 (15%)
  • SAS Analytics & BI (closed): 20,941 (32%)
  • Business Intelligence & Analytics Group: 20,000 (4%)
  • Global Analytics Network: 19,389 (11%)

However, there seems to be no strong correlation between group size and growth rate among all 30 groups.

Here are 10 groups with the fastest growth in the past 12 months (March 25, 2013 to March 31, 2014)

  • RDataMining: 126%
  • Data Mining, Statistics, Big Data, and Data Visualization: 116%
  • Data Scientists: 114%
  • Big Data and Analytics: 100%
  • BIG DATA Professionals – Architects Scientists Analytics Experts: 92.5%
  • Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group: 82%
  • Advanced Business Analytics, Data Mining and Predictive Modeling: 74%
  • KDnuggets Analytics, Data Mining, and Data Science: 73%
  • Predictive Analytics Network (PAN): 72%

The chart below shows group growth vs group size. Color corresponds to age – redder is younger, bluer is older. Group name abbreviations are in the table below.

Top Linked Analytics Group by 2014 size vs growth
Fig 2: Top Linked Analytics, Big Data, Data Science Groups by 2014 size vs growth

There are 2 main measures of group activity: discussions (posts)/week and comments/week. Since these numbers clearly depend on the group size, we measure them per 1000 members. We measure overall group activity as (discussions + comments / week) per 1000 members.

For 4 months ending in March 2014, activity level was 2.99/week, about 25% less than 3.97/week measured in Nov 2013.

The chart below shows group activity vs group size. Color corresponds to age – redder is younger, bluer is older. Group name abbreviations are in the table below.

Top Linked Analytics Group by 2014 Activity vs Growth
Fig 3: Top Linked Analytics, Big Data, Data Science Groups – 2014 Activity vs Growth

In 4 month ending in March 2014 the average activity level was 2.03 discussion/week per 1K members, and 0.96 comments/week per 1K members, or about 2.1 discussions/comment, well below 2.57 discussions/week per 1K members and 1.40 comments/week per 1K members measured in Nov 2013 (1.8 discussions/comment). This means that the while activity has slowed down, the gap between discussions and comments has increased.

The chart below shows average comments/week vs average discussions/week for all 30 groups, with a circle size proportional to group size and circle color corresponding to activity change – green meaning increase, red decrease. We also show median lines for each dimension, which can be used to divide the groups in 4 quadrants.

Top Linked Analytics Group by 2014 Activity vs Growth
Fig 4: 4 Quadrants of Top Linked Analytics, Big Data, Data Science Groups: Commenting vs Posting

Several groups stand out: KDnuggets has the highest number of discussions/1000 members, while RDM has a highest number of comments. The median line divide the groups in 4 quadrants, which we can characterize as

  • “Engaged” (above median on both comments and discussions): KDnuggets, Dscientists, PAN, DM Stat, RDM, Big Data & A, Adv BADM)
  • “Posting” (above median on discussions, below median on comments): DSC, Global A
  • “Commenting” (below median on discussions, above median on comments): NGMR, RMA/RMDS, PR
  • “Passive” (below median on both comments and discussions)

The details are in the table with below, with groups ordered by the number of members. The link to the raw data is at the end of the post.

The growth, comments, and discussions are in green font if that value is 25% above average,
in red if 25% below average, and in black otherwise.
We note that there are only 4 “triple green” groups, that are significantly above average on growth, comments, and discussions:

  • Data Mining, Statistics, Big Data, and Data Visualization
  • Data Scientists
  • RDataMining
  • KDnuggets Analytics, Data Mining, and Data Science
LinkedIn Group Members
(Mar 31, 2014)
Founded 12 mon Growth
annua
lized
Cmt/
week
per 1K mbr
Disc/
week
per 1K mbr
Average 23058 22-Dec-08 53% 0.96 2.03
Adv BADM Advanced Business Analytics, Data Mining and Predictive Modeling (Adv BADM) 121816 28-Sep-07 74% 1.86 1.69
Big Data ASFSSP Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group (Big Data ASFSSP) 95638 20-Feb-09 82% 0.76 1.54
Big Data & A Big Data and Analytics (Big Data & A) 74350 1-Mar-12 100% 1.97 1.86
Biz Analytics Business Analytics (Biz Analytics) 53345 3-Mar-08 43% 0.71 0.62
DM Stat Data Mining, Statistics, Big Data, and Data Visualization (DM Stat) 43761 25-Jul-08 116% 1.92 3.24
BD Prof BIG DATA Professionals – Architects Scientists Analytics Experts (BD Prof) 30792 1-Sep-08 92% 0.84 3.31
NGMR Next Gen Market Research (NGMR) (NGMR) 23368 26-Sep-07 15% 2.13 1.28
SAS A&BI SAS Analytics & BI (closed) (SAS A&BI) 20941 25-Jun-08 32% 0.48 0.50
BI&A Business Intelligence & Analytics Group (BI&A) 20000 6-Jan-08 4% 0.21 1.48
Global A Global Analytics Network (Global A) 19389 23-May-08 11% 0.22 1.91
ML Conn Machine Learning Connection (closed) (ML Conn) 19087 12-Mar-08 52% 0.78 0.53
Pattern Recognition, Data Data Mining, Machine Intelligence (closed) (PR) 16297 2-Oct-08 60% 1.06 0.19
SAS Users SAS & Analytics Users (SAS Users) 15121 13-Apr-08 34% 0.22 1.20
Actuary Actuary / Actuarial, Predictive Modeling, Data Mining, and Statistics News / Jobs / Careers Group (Actuary) 14930 24-Sep-08 28% 0.17 0.25
RMDS Research Methods and Data Science (RMDS, former RMA) 14929 10-Apr-09 35% 1.05 1.29
Text A Text Analytics (Text A) 13947 2-Jun-08 27% 0.76 1.04
DSC Data Science Central (DSC) 13112 10-Feb-12 48% 0.64 3.45
Adv A Advanced Analytics (closed) (Adv A) 12025 11-Jan-09 66% 0.98 2.51
Visual Visual Analytics (Visual) 8450 31-Mar-08 50% 0.81 1.80
D&TA Prof Data & Text Analytics Professionals (D&TA Prof) 8003 24-Sep-07 22% 0.32 1.35
PAN Predictive Analytics Network (PAN) (PAN) 7623 16-Mar-09 72% 0.90 5.79
Adv AP Advanced Analytics, Predictive Modeling & Statistical Analyses (closed) (Adv AP) 7278 10-Jul-08 41% 1.06 0.12
Dscientists Data Scientists (Dscientists) 7052 8-Jun-09 114% 2.18 6.72
Lavastorm Lavastorm Analytics Community Group (Lavastorm) 6349 17-Apr-11 22% 0.08 0.66
RDM RDataMining (RDM) 4972 30-Aug-11 126% 2.88 2.63
KDnuggets KDnuggets Analytics, Data Mining, and Data Science (KDnuggets) 4886 4-Feb-08 73% 2.57 9.68
DMT Data Mining Technology (closed) (DMT) 4034 20-Jun-08 33% 0.19 1.43
PMML Predictive Model Markup Language (PMML) (PMML) 3596 24-Sep-09 18% 0.32 0.23
Healthcare Healthcare Data Mining and Modeling (Healthcare) 3594 11-Jul-08 63% 0.63 0.75
BI Tools Business Intelligence Tools (BI Tools) 3044 2-Jul-08 42% 0.25 1.81

Note: You can get actual data from the HTML source code of the LinkedIn group Statistics/Activity page.

Look for dataset seriesName=”Comments” and parse that data. Likewise for Discussions and Members.

Thanks to Anmol Rajpurohit for collecting the membership, comments, and discussions data.

Here is raw data (csv) for the top 30 LinkedIn groups.

Let me know which relevant groups were missed and what other trends you see.

By Gregory Piatetsky, KDnuggets
Originally published at www.kdnuggets.com

One thought on “Top LinkedIn Groups in 2014 for Analytics, Big Data, Data Mining, and Data Science

  1. Since LinkedIn’s records are combined from more than 50 offline data flows, Hadoop may use its enormous dataset. LinkedIn chose Teradata to address the rising needs in batch processing in order to assure business continuity. The data flows and datasets for the DWH were created and managed by Big Data Engineering. basketbros

     

Leave a Reply