Big Data/BI Zone is brought to you in partnership with:
  • submit to reddit
Arthur Charpentier03/25/13
863 views
0 replies

Comparing Quantiles for Two Samples

Recently, for a research paper, I had some samples and I wanted to compare them. Not to compare the means (by construction, all of them were centered) but the dispersion. And not their variance, but more their quantiles. Consider the following boxplot type function, where everything is quantile related.

John Cook03/24/13
1758 views
0 replies

Appreciation for Plain Text Files

The development of my attitude toward plain text files over time, graphed.

Daniel Bartl03/24/13
504 views
0 replies

Ubercharts Live Charts with Wicket 6 and Websockets

Today we want you to show how you can provide live tracking charts for Web-based dashboards. This small showcase Application is based on Wicket 6, Wicket Websockets and Ubercharts.

Eric Gregory03/23/13
2876 views
1 replies

Large-Scale Data Processing with MapReduce and PHP

This PHPDay talk from David Zuelke explores data processing with PHP and MapReduce.

John Cook03/23/13
2232 views
0 replies

Nonnerdy Applied Mathematics

Applied mathematics as pragmatic problem-solving methodology.

Troy Hunt03/22/13
2917 views
0 replies

Are We Ready to Bank via Facebook?

Banking, you say? In your Facebook, you say? What could possibly go wrong?

Eric Gregory03/22/13
2089 views
0 replies

BigQuery Gets 'Big JOIN' and More New Features

Google recently announced some major new features for its BigQuery analytics tool, including SQL-esque join and aggregate functionality, native TIMESTAMP support, and an expanded web UI.

Tharindu Mathew03/22/13
284 views
0 replies

Solving the NameNotFoundException When Connecting to IBM MQ through JMS

If you are facing an exception like the one at [0], then the problem might be hard to figure out. Because, if you look at the Queues in the MQ explorer, the queu named FOOQ will be there.

Daniel Korzekwa03/22/13
641 views
0 replies

Tennis, Scala, and Expectation Propagation Bayesian Inference

Here's a tutorial on modelling skills of tennis players with TrueSkill rating model in Scala.

Christopher Taylor03/21/13
1445 views
0 replies

Data and Connectedness Create a Reputation Wild West

Welcome to 24 x 7 x 365 connectedness and the challenges that come with an always-on world. That world is generating data, loads of data. And the more we’re ‘on’, the more data, and the more reason to be ‘on’. It’s a never ending cycle of chasing our data tail.

Arthur Charpentier03/21/13
1901 views
0 replies

Surveillance States, How Open Source Threatens SAS, and More Data Links

This week, Arthur Charpentier brings us the Internet as surveillance state, how Mahout and R threaten SAS, the social structure of news, and much more.

John Cook03/21/13
1737 views
0 replies

Preparing for Google Reader Going Away

Most of you who subscribe to this blog use Google Reader or use an RSS reader that depends on Google’s Feedfetcher. Here’s a snapshot from before Google announced the end of Reader.

Jerry Nixon03/21/13
411 views
0 replies

DevRadio: UX Guidance for Handling Large Data in Windows 8 Apps

In today’s episode, Tyler showcases the Netflix and Newegg apps for Windows 8 as good examples of apps that make it easier for users to find and process information.

Damien Lepage03/20/13
2559 views
0 replies

If Java was a Haskell - The Type System

In my previous article If Java was a Haskell I tried to explain the pure functional paradigm offered by Haskell through the words of a Java developer. In this post I will focus on the other strength of Haskell, a strong type system.

John Cook03/20/13
296 views
0 replies

An Incomplete Post About Sphere Volumes

This is an incomplete blog post. Maybe you can help finish it. One of the formulas I’ve looked up the most is the volume of a ball in n dimensions. I needed it often enough to be aware of it, but not often enough to remember it.

Simon Martinelli03/20/13
584 views
0 replies

New Open Source Project SQL Result Mapper

In JPA there is no Constructor Expression for native SQL queries. SQL Result Mapper fills the gap! And because the implementation was quite easy there is an implentation for JDBC as well.

Ravi Kalakota03/20/13
454 views
0 replies

Data-as-a-Service (DaaS)

DaaS strategies have increased dramatically in the last few years with the maturation of technologies such as data virtualization, data integration, SOA, BPM and Platform-as-a-service.

Chris Keene03/19/13
4138 views
0 replies

Hadoop Will Not Mow Your Lawn

It turns out that when you have a lot of "best minds" working on the same problem, you come up with some pretty interesting technology - no matter how inane that problem may be.

Eric Gregory03/19/13
2269 views
0 replies

Links You Don't Want To Miss (3/19)

Today: Etsy's methodical battle-plan for rooting out the Java browser plugin from its systems, the downside of shiny new technologies, and the burgeoning market for cat apps.

Jason Whaley03/19/13
792 views
0 replies

On Schneier's Survelliance State

A brain dump of thoughts and rebuttals that immediately sped through my mind as I read Bruce Schneier's latest piece, "The Internet is a Surveillance State."

John Berryman03/19/13
875 views
0 replies

Solr Unleashed: Mission Accomplished

This past Wednesday and Thursday (March 13th and 14th) OpenSource Connections held an on-site 2-day Solr training course called Solr Unleashed.

Mikio Braun03/18/13
4039 views
2 replies

Talking About Google Reader

As you’ve probably heard by now, Google is shutting down Google Reader on July 1, 2013. Reactions are mixed.

John Berryman03/18/13
1638 views
0 replies

Getting Started Quickly with Hadoop and MapReduce

So here’s the problem: You’ve finally found a block of time to set down and get your head around Hadoop and MapReduce.

Daniel Bartl03/18/13
830 views
0 replies

Big Data Analyses with R Training

Everyone wants to deal with Big Data and the new hype topic is on every CIO’s mind. There are currently many different definitions available to cover as many big data topics as possible.

Rafał Kuć03/18/13
432 views
0 replies

Win Free Copies of Packt’s New Book on Apache Solr

Readers would be pleased to know that Solr has teamed up with Packt Publishing to organize a Giveaway of the Apache Solr 4 Cookbook.