Random thoughts on a Friday afternoon… We’ve all got problems. More to the point, every IT department or team has problems of some kind. It’s why we hire consultants, buy products, start long and arduous journeys into the great unknown depths of root cause analysis, and so on. What fascinates
The Delphix Alchemist?
Alchemists are best known for their (completely fictional and entirely ridiculous, but that’s besides the point) amazing ability to turn lead into gold. Let’s face it, there’s a lot of lead in the Oracle world. Bugs, angry developers, metrics that can seem to elude human understanding…but I digress. The question
Some Oracle Community News
With all the posts on Hadoop lately I bet you forgot I am an Oracle geek too, huh? Well, if the name OracleAlchemist wasn’t enough to remind you, I have some news that might. Two big things have happened this week that I’d like to share. OracleCommunity.net Eddie Awad–the industrious
The High Price of Data
You’ve purchased servers, storage space, switches, cables, and countless other pieces of hardware. The Oracle licenses are bought and paid for, Enterprise Edition with a few add-ons. All told, you’ve spent a small fortune on this infrastructure. It’s finally time to start up your database and begin using it for
Hadoop Streaming, Hue, Oozie Workflows, and Hive
MapReduce with Hadoop Streaming in bash – Bonus! To conclude my three part series on writing MapReduce jobs with shell script for use with Hadoop Streaming, I’ve decided to throw together a video tutorial on running the jobs we’ve created in Oozie, a workflow editor for Hadoop that allows jobs
MapReduce with Hadoop Streaming in bash – Part 3
In our first MapReduce with Hadoop Streaming in bash article, we took a collection of Stephen Crane poems and used a MapReduce job to calculate ‘term frequency’–meaning we counted the number of times each word in the collection appeared in the collection. In the second part, we calculated ‘document frequency’
MapReduce with Hadoop Streaming in bash – Part 2
In MapReduce with Hadoop Streaming in bash – Part 1 we found the ‘term frequency’ of words within a collection of documents. For the documents I chose 8 Stephen Crane poems, and our bash Map and Reduce jobs tokenized the words and found their frequency among the entire set. The
MapReduce with Hadoop Streaming in bash – Part 1
So to commemorate my recent certification and because my Java absolutely sucks, I decided to do a common algorithm using Hadoop Streaming. Hadoop Streaming Hadoop Streaming allows you to write MapReduce code in any language that can process stdin and stdout. This includes Python, PHP, Ruby, Perl, bash, node.js, and
Cloudera Certified Developer for Hadoop (CCDH)
Taking the Cloudera Developer Training for Apache Hadoop had many rewards — one of which was a free voucher to take the CCD-410 Exam (normally $295) which you must pass to get CCDH certified. I’m not sure if that’s a Cloudera University or Global Knowledge thing, but either way it
How I Became a DBA
The DBA job title is an interesting one. While everyone can understand “programmer” or “software tester” or even “system administrator”, the DBA role is so misunderstood by both muggles (I’m going to hell for using that word) and colleagues alike. In fact, at this point I’m fairly certain that it