2018 First Annual theColi Analytics Awards

Spiritual Stratocaster

Jesus is KING
Supporter
Joined
Aug 14, 2014
Messages
38,606
Reputation
7,273
Daps
149,547

MikeOck

Banned
Joined
Apr 25, 2018
Messages
1,112
Reputation
-605
Daps
3,907
If you're on any of those lists, you need to step back from this site for a bit.
 

newworldafro

DeeperThanRapBiggerThanHH
Joined
May 3, 2012
Messages
50,014
Reputation
4,804
Daps
112,631
Reppin
In the Silver Lining
Created a spider (scrapy in python) that crawled the Coli posts and extracted user data. Then I just analyzed (BeautifulSoup in Python) organized and posted.

Is that Python's main purpose, for data analytics? I want to learn how to use it in ArcGIS.

Its data. You can spin it how you want but if these guys were all spamming, they wouldn't make it to this amount of posts. That said, Nap is a team of agents in Langley, VA



CkLvq.gif


I really need to start posting more. :francis::ld:

Nah quality over quantity.
Organic over artificial posts.
 
Last edited:

newworldafro

DeeperThanRapBiggerThanHH
Joined
May 3, 2012
Messages
50,014
Reputation
4,804
Daps
112,631
Reppin
In the Silver Lining
@DrBanneker I :clap: :salute: :cheers: for this compilation of big data.

Can you replicate this thread for the main subforums at least, like the following:
  • The Coliseum
  • The Booth
  • The Locker Room
  • The Arcadium
  • The Film Room
  • TSC
  • Just Buggin Out
  • Higher Learning
^^^^^Is that to much work or doable, since you have the software it appears doing the number crunching for you? I think know the posters would love that shiiit.

Release the data in those 8 subforums on the same day :wow:.......

you should scrape the key words for top threads made :sas2:

Good one. :ehh:

If you scrape for most current event predictions, proven correct, I gotta be #1.. :sas1:
 
Last edited:

DrBanneker

Space is the Place
Joined
Jan 23, 2016
Messages
5,576
Reputation
4,516
Daps
19,150
Reppin
Figthing borg at Wolf 359
Is that Python's main purpose, for data analytics? I want to learn how to use it in ArcGIS.

Python is a general use script/programming language but it has a huge library of scientific/data science type modules available for free so it is extremely useful in that regard. The links below will probably interest you. Python is very easy (easier than C++, Java, etc.) so you could probably pick up the basics in a couple of weeks and start doing stuff pretty quickly. PM or start a thread in 6 Figures 6 Certs if you want details

GIS in Python — Data Analysis in Python 0.1 documentation
What is ArcPy?—Help | ArcGIS Desktop
 

DrBanneker

Space is the Place
Joined
Jan 23, 2016
Messages
5,576
Reputation
4,516
Daps
19,150
Reppin
Figthing borg at Wolf 359
@DrBanneker I :clap: :salute: :cheers: for this compilation of big data.

Can you replicate this thread for the main subforums at least, like the following:
  • The Coliseum
  • The Booth
  • The Locker Room
  • The Arcadium
  • The Film Room
  • TSC
  • Just Buggin Out
  • Higher Learning
^^^^^Is that to much work or doable, since you have the software it appears doing the number crunching for you? I think know the posters would love that shiiit.

Release the data in those 8 subforums on the same day :wow:.......



Good one. :ehh:

If you scrape for most current event predictions, proven correct, I gotta be #1.. :sas1:

I am working on this and the script is about 90% done though there are some bugs and other issues with processing this much data. I hope to get it pretty soon though.
 

Maude

Superstar
Supporter
Joined
Mar 4, 2014
Messages
4,582
Reputation
3,190
Daps
19,555
Reppin
The Midwest
Created a spider (scrapy in python) that crawled the Coli posts and extracted user data. Then I just analyzed (BeautifulSoup in Python) organized and posted.
rep, never heard of scrapy. did you use pandas to analyze the data?
 

invalid

Banned
Joined
Feb 21, 2015
Messages
19,972
Reputation
6,797
Daps
80,752
@DrBanneker Have you started making year over year comparisons?
For instance, it would be interesting to see the most used smileys every year over the past five years.
And look into why these smiley expressions were the most popular.
And what it may say about our own psychology.

Other things that could be looked into -

Top 10 posters by post count - historical look at their post counts for the past 5 years.
And look more into the reasons for an up/down in post count - unemployment, increase in productivity, etc.

Top 10 posters by dap/to post ratio - historical look at their dap/to post over the past 5 years.
Look into consistency or surge/drop.
Causes for surge/drop.

Top 10 posters by reputation - historical look at their reps over the past 5 years.
What type of postings caused coli posters to respond in a way that was positive enough to want to pos rep them.

Top 10 posters with the worst reps - historical look at their reps over the past 5 years.
What type of postings caused coli posters to respond in a way that was negative enough to want to neg rep them.

Types of post that received the most daps.
Types of post that received the most pos reps.
Types of post that received the most negs.
Types of threads that received the most responses.
Which months receives the most post per day. 5 year historical look.
Which seasons received the most post out of the year. 5 year historical look.
Types of threads that get bushed.
Types of threads that go to the raftors.

Literally could do a blueprint of the types of threads that garners the most response from the board, what types of post garners the most pos reps, etc. for those that are looking to increase their popularity on the site.

And then do a pyscho analysis of all the findings - what this says about the coli, what we find important/not important, etc.
 

Tr0yTV

All Star
Joined
Mar 20, 2017
Messages
1,235
Reputation
916
Daps
5,136
Can you post the script? I want to review it for research purposes :lolbron:
 

DrBanneker

Space is the Place
Joined
Jan 23, 2016
Messages
5,576
Reputation
4,516
Daps
19,150
Reppin
Figthing borg at Wolf 359
@DrBanneker Have you started making year over year comparisons?
For instance, it would be interesting to see the most used smileys every year over the past five years.
And look into why these smiley expressions were the most popular.
And what it may say about our own psychology.

Other things that could be looked into -

Top 10 posters by post count - historical look at their post counts for the past 5 years.
And look more into the reasons for an up/down in post count - unemployment, increase in productivity, etc.

Top 10 posters by dap/to post ratio - historical look at their dap/to post over the past 5 years.
Look into consistency or surge/drop.
Causes for surge/drop.

Top 10 posters by reputation - historical look at their reps over the past 5 years.
What type of postings caused coli posters to respond in a way that was positive enough to want to pos rep them.

Top 10 posters with the worst reps - historical look at their reps over the past 5 years.
What type of postings caused coli posters to respond in a way that was negative enough to want to neg rep them.

Types of post that received the most daps.
Types of post that received the most pos reps.
Types of post that received the most negs.
Types of threads that received the most responses.
Which months receives the most post per day. 5 year historical look.
Which seasons received the most post out of the year. 5 year historical look.
Types of threads that get bushed.
Types of threads that go to the raftors.

Literally could do a blueprint of the types of threads that garners the most response from the board, what types of post garners the most pos reps, etc. for those that are looking to increase their popularity on the site.

And then do a pyscho analysis of all the findings - what this says about the coli, what we find important/not important, etc.

I think I can do a lot on the post stats and daps/post by user. The issue with reputation over time is there is no public record of who gives rep or when people lose it. Their rep always reflects their current status so unless I can get cached versions of the site or webmaster tech support, I am pretty stuck on rep.
 
Top