some interesting remarks on Big Data, Hadoop, ML, Deep Learning, etc by Michael Stonebraker

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

BLinux

cat lover server enthusiast
Jul 7, 2016
2,672
1,081
113
artofserver.com
If you've been involved with anything related to the buzzwords in the title of this post, Stonebraker's remarks in this video are interesting:

 
  • Like
Reactions: Patrick

BLinux

cat lover server enthusiast
Jul 7, 2016
2,672
1,081
113
artofserver.com
his points resonate with my experiences dealing with big data analytics and machine learning efforts at all the corporations i've worked with in the last 8 years or so. so much time is spent on figuring out how to transport the data (large, large pipes) and/or massage the data (cleaning/sanitizing, enriching, categorizing, etc.) before it ever gets to the actual analytics.

it also resonates with me on the issue of getting good training data sets for learning algorithms. just as school curriculum and textbooks are carefully selected and curated, you have to do the same with things like ANNs.