Posts Tagged ‘Hadoop’

Presenting at the Raleigh-Durham No Fluff Just Stuff Symposium

August 29th, 2010

North Carolina

This week, I made a four day journey to the very forested state of North Carolina. Joey knew a Coloradoan was coming and turned on the statewide AC to bring it down to a comfortable 72 degrees Fahrenheit when I landed. The food was great, the people were super, and the technology was awesome.

Relevance

I had an open invitation to come out and visit the team at Relevance, which I’d been waiting to cash in. The Research Triangle NFJS Symposium finally made that visit possible.

I had a great time meeting the entire Relevance team, working with Stu Halloway on automating the Clojure release scripts through some Bash scripting, Git calls and Maven Ant Tasks.

At lunch, I gave a live demo of my workflow with the DevonThink Pro product, including capturing and aggregating multiple RSS streams alongside archived emails and snippets from web pages.

In the afternoon, I had the fun assignment of working with Aaron Bedra on an implementation of JCE symmetric AES encryption on a Clojure project. He followed up a day later on “Relevance Open Source Friday” by beginning to move the implementations over to a standard library for upcoming public consumption.

No Fluff, Just Stuff

I had the pleasure of presenting Encryption, Open Source Debugging on the JVM (over 50 deliveries of that one now), Hadoop, Maven 3 and Git to the engaged audience in Raleigh on Friday and Saturday. The during-presentation questions were spot on, and even when the topics got heady, the students just leaned forward in their chairs and kept on making insightful inquiries. Attendees of that nature are pure candy to a passionate presenter like myself. I especially want to thank Darin Pope, Billy Dupre, David Bloom, David Deininger, Sri Sankaran, Asif Rashid, and Ed Savage for providing much-desired feedback on Twitter.

I know I’ll be back for the NFJS show next year, but I’ll make all efforts to put in a few more visits prior to that. A city with a technology culture of this strength demands that I do. Thanks for having me and perhaps I’ll see some of you at the Rich Web Experience in December on the beaches of sunny Florida.

Resources

For the folks that attended my talks this weekend, here are some constantly updated supplemental materials to resources that are paired with the slides:

Encryption

Open Source Debugging

Hadoop

Maven 3

Git

Denver JUG Hadoop and Encryption Presentations

January 15th, 2010

Denver JUG January Meeting

I had the pleasure of hanging out with about 60 of my local friends at the Denver Java Users Group (DJUG to the locals) on Wednesday night and talking about Encryption on the JVM as well as Hadoop. I had the good fortune of having Andy Sautins of Returnpath.net, who’s an active user of Hadoop, field a few of the questions. I really appreciate the time a few of the folks spent giving me feedback on Speakerrate.com. For your future reference, below are the slides and sample source. Feedback and suggestions are always welcome at matthewm@ambientideas.com

Encryption Bootcamp on the JVM

Abstract

Does your application transmit customer information? Are there fields of sensitive customer data stored in your DB? Can your application be used on insecure networks? If so, you need a working knowledge of encryption and how to leverage Open Source APIs and libraries to make securing your data as easy as possible. Encryption is quickly becoming a developer’s new frontier of responsibility in many data-centric applications.

In today’s data-sensitive and news-sensationalizing world, don’t become the next headline by an inadvertent release of private customer or company data. Secure your persisted, transmitted and in-memory data and learn the terminology you’ll need to navigate the ecosystem of symmetric and public/private key encryption.

Intro to Hadoop

Abstract

Moore’s law has finally hit the wall and CPU speeds have actually decreased in the last few years. The industry is reacting with hardware with an ever-growing number of cores and software that can leverage “grids” of distributed, often commodity, computing resources. But how is a traditional Java developer supposed to easily take advantage of this revolution? The answer is the Apache Hadoop family of projects. Hadoop is a suite of Open Source APIs at the forefront of this grid computing revolution and is considered the absolute gold standard for the divide-and-conquer model of distributed problem crunching. The well-travelled Apache Hadoop framework is currently being leveraged in production by prominent names such as Yahoo, IBM, Amazon, Adobe, AOL, Facebook and Hulu just to name a few.

In this session, you’ll start by learning the vocabulary unique to the distributed computing space. Next, we’ll discover how to shape a problem and processing to fit the Hadoop MapReduce framework. We’ll then examine the incredible auto-replicating, redundant and self-healing HDFS filesystem. Finally, we’ll fire up several Hadoop nodes and watch our calculation process get devoured live by our Hadoop grid. At this talk’s conclusion, you’ll feel equipped to take on any massive data set and processing your employer can throw at you with absolute ease.