Tuesday, February 25, 2014

INFORGRAPHIC: The Journey of WhatsApp

It is still amazing to note that WhatsApp, a 5-year-old instant messaging platform is worth a whooping sum of $19 billion dollars. 

WhatsApp has over 450 Million active users; this is the first time in history that an instant messaging social media platform will have such a massive user-base within a short time.


Statistics have shown that WhatsApp’s journey has been smooth and rewarding. Woodpeck3r just designed an Info-graphic called ‘Journey of WhatsApp’ and you will be much interested to see this below:


Photo Credit: Woodpeck3r

#OpenData: About 88% Of Nigerians Are Gearing Up To Vote In The 2015 Election - NOIPolls

By NOIPolls

Photo Credit: BBC News 

The recently released Election Poll results by NOIPolls Limited has revealed that majority of adult Nigerians claim to be registered voters (75%). Also, of the 25% yet-to-register, the vast majority (92%) plan to register in time for the 2015 elections. The results further revealed that majority of registered voters (88%) are eagerly looking forward to voting in the 2015 elections; while citing the need “to exercise their right to vote as Nigerians” (40%) and “to vote for the right/good leader” (31%) as the top reasons. On the other hand, of the 12% of registered voters who do not look forward to voting, the most cited reasons include the suspicion that “their vote will not count” (36%), they “Don’t have the time” (22%), and they “Don’t Trust the Elections” (21%). These are the key findings from the Countdown to the 2015 Election Poll conducted in the week of February 17th 2014.

Brief Background
In January 2014, Mrs. Augusta Ogakwu; the Secretary of the Independent National Electoral Commission (INEC) announced that the Presidential and National Assembly elections will hold on February 14, 2015; while the State Assembly and Governorship elections are scheduled for February 28, 2015. Mixed reactions have trailed the election timetable; some Nigerians applauded INEC for the timely release of the schedule for the elections while others have questioned the decision to hold the presidential and National Assembly polls before the governorship and state assembly elections. They argued that holding the presidential poll first would have a bandwagon effect on the subsequent elections
[1].

As Nigeria prepares towards the 2015 polls, INEC Chairman; Attahiru Jega has painted an optimistic picture of the 2015 elections saying the lessons of the 2011 elections had been learnt and new processes are now in place to make the coming elections “much better than anything in the past.” Questions that are currently swirling around and being debated in the polity include” Has the PDP been weakened irrevocably? Can the APC hold itself together? Will Jonathan win if he runs again?”
[2]

Against this background, NOIPolls conducted its Countdown to the 2015 Elections poll to seek the views of Nigerians regarding the 2015 general elections. This is the second in the series of election polls conducted by NOIPolls (the first one was conducted in April 2013) and the first in the series of bi-monthly polls that will be conducted as Nigeria countdowns to the actual elections in 2015.       

Respondents to the poll were asked five specific questions. In order to gauge the proportion of registered voters, respondents were asked: Presently, are you a registered voter? Overall, the majority (75%) responded positively, claiming they are currently registered voters; while25% of Nigerians responded negatively.

Further analysis by geo-political zones, indicates that the South-East zone (83%) has the highest proportion of registered voters, while the North-West zone and the South-South zone (both 28%) have the highest proportion of those that are not yet registered. In addition, the age-group that has the highest percentage (89%) of registered voter is 46-60 years.

 
[1] The Sun Newspapers
[2] Premium Times


 
Respondents who are not registered (25% of the total) were further asked: If No, do you plan to register in time for the 2015 elections? Responses to this question revealed that the overwhelming majority of those not registered; 92% plan to register in time for the 2015elections, while only 8% are not interested in registering at all.

When current findings are compared with results from the previous poll conducted in 2013, there is a significant 16-point increase in the proportion of Nigerians who are yet to register, but indicate they have plans to register in time for the 2015 elections.


The third question sought to gauge the level of enthusiasm among registered voters. Registered respondents (75% of the total) were asked: Are you looking forward to voting in the 2015 elections? Overall, the majority of registered voters (88%) indicated they are looking forward to voting in the 2015 elections; while 12% of the respondents claim they are not looking forward to voting in 2015 elections.

Findings based on geo-political zones show that the South-East zone (93%) had the highest proportion of respondents who are looking forward to voting in the 2015 elections. This is followed by the North-Central zone with 92% and the North-West zone with 89%



Furthermore, when these findings are compared with the previous poll conducted in 2013, there was a slight 2-point decline in the proportion of Nigerians who look forward to voting in the 2015 elections. 


Subsequently, in order to gain insights into factors stimulating the keenness to vote, Nigerians who are looking forward to voting in the 2015 elections (88% of registered voters) were further asked: If yes, why are you looking forward to voting in the 2015 elections? Bearing in mind that this was an open-ended question, the findings revealed that the majority (40%) are looking forward to voting in the 2015 elections because they want “toexercise their right to vote as Nigerians”, while 31% are stimulated by the need “to vote for the right/good leader”. Furthermore, 10% are looking forward to the elections in order “to vote for a change of government”, while another 7% mentioned “for transformation / better Nigeria.

 Analysis based on geo-political zones revealed that the South-West zone (47%) has the highest proportion of Nigerians who are stimulated by the need “to exercise their right to vote as Nigerians”, while the South-East zone (41%) accounts for the largest proportion of respondents that look forward to 2015 elections in order “to vote for the right/good leader”. Also, the South-South zone (30%) has the highest proportion of respondents that indicated “to vote for a change of government”.


Furthermore, trend analysis of the findings obtained from the previous poll revealed a significant 10-point decline in the proportion of respondents who are motivated by the need to vote for the right/good leader; it was 41% in 2013. Furthermore, there was a slim 2-pointincrease in the proportion of Nigerians that want to vote in order to exercise their right to vote as Nigerians in 2014.


Finally, in similar way respondents who indicated they are not looking forward to voting in the 2015 elections (12% of registered voters) were asked: If no, why are you not looking forward to voting in the 2015 elections?  Results indicate the majority (36%) are of the opinion that “their vote does not count”, this is followed by 22% who say “they don’t have time” and 21% who say they have “no trust in elections”. In addition, 14% claim there are demotivated towards voting in the 2015 elections  due to “insecurity” and 5% are of the opinion that the “Nigerian system is bad”.
 
Gauging the responses from the geo-political zone standpoint reveals that the North-Eastzone has the highest (87%) proportion of respondents who think “their vote does not count” while the North-West zone has the highest proportion of respondents who claim “they don’t have time” (45%)  and  also indicated “insecurity” (45%) as a demotivating factor to voting. In addition, the South-West zone (41%) accounts for the highest proportion of respondents who have “no trust in elections”.


When these current findings are compared with the results obtained in 2013, there was an 8-point decline in the proportion of respondents that indicate their “vote does not count” even though it remains the popular opinion among respondents that are not looking forward to voting.

 
In conclusion, findings from the election poll reveals the majority of Nigerians are registered voters (75%) and of this proportion, the overwhelming majority; 88% are looking forward to voting in the 2015 general elections. More findings also reveals that the overwhelming majority;92% out of 25% who are not yet registered plan to register in time for the 2015 elections. Furthermore, in 2014 there was a significant 16-point increase in the proportion of Nigerians who are yet to register but indicated they plan to register in time for the elections. In addition, the vast majority of registered voters are looking forward to voting in the 2015 elections (88%). The most popular reason (40%) given by those that are looking forward to voting is “to exercise their right to vote as Nigerians”, while the most popular reason by those who do not look forward to voting (36%) is that “their vote will not count”. Finally, as the 2015 elections draw closer, it’s clear that voter apathy will be a major deterrent since the majority of Nigerians are currently highly enthusiastic about voting in the upcoming 2015 polls.

Survey Methods
The opinion poll was conducted in February 17th to 19th 2014. It involved telephone interviews of a random nationwide sample. 1,000 randomly selected phone-owning Nigerians aged 18 years and above, representing the six geopolitical zones in the country, were interviewed. With a sample of this size, we can say with 95% confidence that the results obtained are statistically precise - within a range of plus or minus 3%. NOIPolls Limited, No. 1 for country-specific polling services in West Africa, which works in technical partnership with the Gallup Organisation (USA), to conduct periodic opinion polls and studies on various socio-economic and political issues in Nigeria. More information is available at 
www.noi-polls.com
 

_________________________________________________________________
Disclaimer
This press release has been produced by NOIPolls Limited to provide information on all issues which form the subject matter of the document. Kindly note that while we are willing to share results from our polls with the general public, we only request that NOIPolls be acknowledged as author whenever and wherever our poll results are used, cited or published.

NOIPolls hereby certifies that all the views expressed in this document accurately reflect its views of respondents surveyed for the poll, and background information is based on information from various sources that it believes are reliable; however, no representation is made that it is accurate or complete. Whilst reasonable care has been taken in preparing this document, no responsibility or liability is accepted for errors or fact or for any views expressed herein by NOIPolls for actions taken as a result of information provided in this report. Any ratings, forecasts, estimates, opinions or views herein constitute a judgment as at the date of this document. If the date of this document is not current, the views and content may not reflect NOIPolls’ current findings and/or thinking.
 
_______________________________________________________________________
Press Contact
The Editor
Email: editor@noi-polls.com


Monday, February 24, 2014

WhatsApp To Add Voice Calls After Facebook Acquisition

By Leila Abboud and Eric Auchard
BARCELONA (Reuters) - WhatsApp will add free voice-call services for its 450 million customers later this year, laying down a new challenge to telecom network operators just days afterFacebook Inc scooped it up for $19 billion.
The text-based messaging service aims to let users make calls by the second quarter, expanding its appeal to help it hit a billion users, WhatsApp CEO Jan Koum said at the Mobile World Congress in Barcelona on Monday.
Buying WhatsApp has cemented Facebook's involvement in messaging, which for many people is their earliest experience with the mobile Internet. Adding voice services moves the social network into another core function on a smartphone.
On Monday, Chief Executive Mark Zuckerberg defended the price paid for a messaging service with negligible revenue. He argued that rival services such as South Korea's KakaoTalk and Naver's LINE are already "monetizing" at a rate of $2 to $3 in revenue per user per year, despite being in the early stages of growth.
Media reports put WhatsApp's revenue at about $20 million in 2013.
"I actually think that by itself it's worth more than 19 billion," Zuckerberg told the Mobile World Congress. "Even just independently, I think it's a good bet."
"By being a part of Facebook, it makes it so they can focus for the next five years or so purely on adding more people."
WhatsApp's move into voice calls is unlikely to sit well with telecoms carriers.
WhatsApp and its rivals, like KakaoTalk, China's WeChat, and Viber, have won over telecom operators' customers in recent years by offering a free option to text messaging. Telecom providers globally generated revenue of about $120 billion from text messaging last year, according to market researcher Ovum.
Adding free calls threatens another telecom revenue source, which has been declining anyway as carriers' tweak tariffs to focus on mobile data instead of calls.
WITH, NOT AGAINST
Since the advent a decade ago of Skype's voice over Internet service, which Microsoft Corp has acquired, and the rise of Internet service providers like Google Inc, telecom bosses have gotten used to facing challengers whose services piggyback on their networks. But carriers complain that the rivals are not subject to the same national regulations.
Mats Granryd, the CEO of Swedish mobile operator Tele2, said he was happy to partner with the likes of WhatsApp because of the additional data traffic they generate. But he shared the concerns of other network operators that they must operate under strict national regulations that Internet companies are not subject to.
"They (Internet firms) need to be regulated a little bit more and we need to be regulated a little bit less," said Jo Lunder, who heads Russian mobile network operator VimpelCom.
Vodafone CEO Vittorio Colao said he did not understand how such an important acquisition as the Facebook-WhatsApp deal could go unchallenged at a time when European network operators were facing intense regulatory scrutiny.
"These types of deal are a clear indication that the world is changing and the regulations don't fit anymore," Colao said on the sidelines of the conference.
Both Facebook and WhatsApp CEOs have cast themselves as partners to telecoms network operators.
On Monday, Koum also announced a partnership with E-Plus, the German subsidiary of Dutch group KPN, under which it will launch a WhatsApp-branded mobile service in Germany.
The European Parliament is set to vote on Monday night on a package of proposed telecoms market reforms which among other provisions would restrict the ability of carriers to charge internet companies like Facebook to give them an enhanced service in handling their network traffic.
(Additional reporting by Kate Holton; Editing by Greg Mahlich)


Technology Giant Samsung Launches New Galaxy S5

Reported by Press Association

Photo Credit: Daily Mail 
Samsung sought to frame its new Galaxy S5 smartphone as a lifestyle product as it emphasised a built-in heart-rate sensor and improved camera features over its slightly larger size.
One of the main appeals of Samsung phones has been their size. The screen has steadily increased since the 4in (10.2 centimetres) on the original S from 2010, while the iPhone made that jump to 4in only in 2012 and has stayed that way since.
But the S5 pushes the screen to only 5.1in (13cm), measured diagonally, from 5in (12.7cm) in last year's model. Instead of size, Samsung touted the new phone's ability to adapt its screen to changing external conditions and to dim it to avoid disturbing others nearby.
The phone has a 16 megapixel camera, sharper than the 13 megapixels in its predecessor. It promises faster auto focus and the ability to blur the foreground or background of an image to emphasise a subject.
Samsung made the latest announcement during the Mobile World Congress wireless show in Barcelona, Spain.
The new phone will go on sale worldwide on April 11. The company did not announce a price.
The S5 has a fingerprint sensor to use in place of a passcode to unlock the phone or make payments through PayPal. It is a feature still rare in phones, though Apple introduced it in last autumn's iPhone 5s.
Samsung's Galaxy S series has emerged as one of the strongest challengers to Apple's iPhones and has helped the Korean company surpass Apple as the world's largest smartphone maker. According to Gartner, Samsung's smartphones had a worldwide market share of 31% last year, compared with 16% for Apple's iPhones.
A chief complaint about Samsung phones has been the company's tendency to pack them with a slew of features, some of which do not work well with each other or at all. Recent phones have sported an Easy Mode, with larger icons and fewer customisation choices. It is as though Samsung acknowledges that its devices have become too complex for many people to use.
Samsung showed restraint this time.
"Samsung is betting big on wellness, fingerprint reading and camera autofocus, while keeping a very similar look and feel for its hardware and software," said Nick Dillon, a senior analyst at the research firm Ovum. "The updates are so minor that on first glance most consumers would be hard pressed to notice that it has changed from the previous version."
But he said that is to be expected "given the maturity of the smartphone market and the pressure on Samsung not to mess with its winning formula".
The heart-rate sensor on the S5 can be used before and after exercise to measure fitness activities. It is not meant for continuous tracking. Samsung also unveiled a fitness band, Gear Fit, to complement two new computerised watches announced on Sunday. Those will be available on April 11 as well.
"These devices are Samsung's commitment and vision to great experiences that matter the most to us all," Samsung European executive Jean-Daniel Ayme said.
Parents, meanwhile, will enjoy the ability to hand the phone to a child without worry. Just place it in a child's mode and only approved apps can be accessed. Your child cannot send your boss an email or post an embarrassing picture on Facebook when all you intended was to have your child play Candy Crush Saga.
The phone is also water resistant.
"Our consumers do not want eye-popping technology or the most complex technology," said JK Shin, Samsung's head of information technology and the mobile communications division. "Our consumers want durable design and performance. Our consumers want a simple, yet powerful camera."


New Post: Network Router Caused WhatsApp's 'Biggest' Outage

Photo Credit  KnowYourMobile
(Reuters) - WhatsApp founder Jan Koum on Sunday issued an apology and blamed a network router for Saturday's outage of the mobile messaging app.
"We are sorry about the downtime," wrote Koum. "It has been our longest and biggest outage in years. It was caused by a network router fault which cascaded into our servers."
"We worked with our service provider on resolving the issue and making sure it will not happen again."
WhatsApp was down for more than three hours on Saturday just days after Facebook bought it for $19 billion.
The five-year old company currently has about 450 million users worldwide and is the leading smartphone-based messaging app.
(Reporting by Jennifer Saba in New York; Editing by Marguerita Choy)

New Post: LG Plans Launch Of First Smartwatch

(Press Association) LG Electronics has said it will launch a computerised wristwatch later this year, entering a nascent market where Samsung, Sony and smaller companies such as Pebble are already jostling for dominance.
Park Jong-seok, president of LG's mobile communications division, said early smartwatch models failed to demonstrate why consumers should buy them. He said LG's strategy is not to release a half-baked product but, like other smartwatches, the LG smartwatch will be paired with a smartphone.
The South Korean company announced its smartwatch plans at a mobile industry fair in Barcelona, Spain. Mr Park made his comments during a pre-announcement briefing last week.
LG was a late comer in both smartphones and tablets compared with its home rival Samsung, now the world's largest maker of smartphones.
LG spokeswoman Kim So-yeong declined to comment on news reports that LG will manufacture an Android-powered smartwatch for Google. LG already makes some of Google's Nexus mobile products.
Part of LG's efforts to boost its mobile brand in the crucial North American market was to collaborate with Google. It manufactured Google's Nexus 5 smartphone, the first mobile device to be powered by KitKat, which is the latest version of Google's Android operating system, and the Nexus 4 smartphone.
LG Electronics finished 2013 as a fourth-largest smartphone maker in the world according to research firm Gartner. But the No 4 title does not mean its business is profitable.
LG's mobile division is among the distant second-tier group in the market where nearly all profit is taken by the two leading companies - Samsung and Apple. LG lost 58.5 million US dollars (£35.2 million) in the final three months of 2013 due to hefty marketing costs and falling smartphone prices.
Samsung, which sold 1 million Android-powered Galaxy Gear smartwatches to retailers and mobile carriers last year, dropped Google's Android in its latest announcement of smartwatches.
Samsung unveiled two new smartwatches yesterday on the eve of the Mobile World Congress in Barcelona. Both are powered by lesser-known operating system called Tizen, developed jointly by Samsung and Intel Corp.


New Post: Netflix To Pay Comcast For Faster Speeds

By Reuters

Photo Credit: BusinessWeek
(Reuters) - Netflix has agreed to pay one of the largest broadband providers in the United States Comcast Corp for faster speeds, throwing open the possibility that more content companies will have to shell out for better service.
Comcast and Netflix made the joint announcement on Sunday, marking the first time that Netflix is paying for faster speeds in the U.S. after customers complained about slow service. Terms of the deal were not disclosed.
The arrangement comes as federal regulators are wrestling with an issue known as "Net neutrality" concerning broadband providers and whether they can slow down traffic to particular websites, potentially forcing content companies to pay for faster Web service.
The Federal Communications Commission said last week it plans to rewrite the rules after a U.S. court struck down the commission's previous version.
The issue is being closely watched as millions of people view movies and TV shows through streaming services offered by such companies like Netflix and Amazon.
Netflix, which got its start as a DVD-by-mail service, has 44 million subscribers worldwide and 34 million in the U.S. alone. About 7 million subscribers pay for mail delivery services.
The companies said in a statement that they have been "working collaboratively over many months" to strike a multi-year agreement. Netflix will not receive preferential network treatment, the companies said.
As part of the deal, Netflix will deliver its movies and TV programs to Comcast's broadband network directly as opposed through third party providers, giving viewers faster streaming speeds for watching movies and TV programs.
It also could force Netflix to strike similar arrangements, known in the industry as interconnect agreements, with other major broadband providers like Verizon and AT&T.
With more than 44 million subscribers throughout the world, Netflix has been making an effort to connect directly with broadband Internet providers. It has struck similar deals with Cablevision and Cox, though Netflix did not pay for these connections.
The arrangement with Netflix comes on the heels of Comcast's agreement to buy Time Warner Cable for $45 billion (27 billion pounds), a deal that will draw the scrutiny of U.S. antitrust enforcers.
The combined company would have a near 30 percent share of the U.S. pay television market, as well as be the major provider of broadband Internet access.
(Reporting by Jennifer Saba in New York; Editing by Meredith Mazzilli)

Wednesday, February 19, 2014

Registration For #OpenData Day Abuja Is Still On!

By YouthhuAfrica

Photo Credit: Wikipedia
Open Data Day is a global community initiative to make and spread open data. People from all around the world gather together on-line or in person to make things with and around open data.
The #ODDAbuja is organized by Follow The Money and you can Join us at CODE, Bassan Plaza, Plot 759, 2nd Floor, F Wing, Behind Total House, Central Business District Abuja on Saturday, February 22, 2014.

The OpenDataDay in Abuja will kick-start our Data Expedition Class [Training on Data and its use]; Update our Education Budget Tracker [ geo-locate higher institutions and funds meant for them], geo-locate primary schools in Nigeria and also get us on a trip to the digital humanitarian world while having fun – Bring your laptops, coffee mugs, and power extension cords.


Deadline: February 20, 2014 [23.00 GMT+1]

To register for participation and for more invitation visit here

Tuesday, February 18, 2014

Make Use Of Your Business Data By Karo Orovboni

By Karo Orovboni

Karo Orovboni
There is an increasing need for businesses to thrive in a highly competitive environment. The ubiquitous Internet has made competition in the business space more rife, with companies trying to gain any sort of advantage over the other. Business decision makers now have to make highly informed, transparent, and accurate decisions about their organisation. They will need to analyse growth based on Key Performance Indicators (KPI), compare their organisation against other competitors, forecast certain metrics, and analyse customer trends in other to remain relevant in the business space.

Your data (daily collection of business activities) is what you need in making these effective and well-informed decisions. If you want growth in your company, it is best to start looking at ways to make use of data. Your data is your business’ most valuable asset; effective use of it could determine where your company finds itself on the success spectrum. As competition continues to stiffen, the establishment that will succeed in the long term is the one that is able to exploit the power of data, turning data into meaningful knowledge for the growth of the business.

According to IBM, “90% of the data in the world today has been created in the last two years alone”! Data is the big deal; data has become the de facto key business decision-making tool. Regardless of your business area/industry, your data is highly critical to your company’s success. The difference between where your business is now and where you would like it to be may just be in your data. Analysing your business data will help you know the products that are generating profit and the ones that are not doing so well. The solution to the stunted business growth you are looking for out there is within – your data.
A retail company for example, that specializes in multi-departmental and grocery items, and already exploits the power of data has put itself in a good stead of succeeding in the long term. The company will be fortified with accurate and instant information to efficiently manage its supply chain, sales, improve customer relationship, effectively manage various store operations and forecast business growth.

The company can use its data to personalise sale and promotions to customers. Let us take for example a customer who visits this retail store, unbeknownst to the store, the customer has just had a baby but as the store now run an effective data management system, and have just begun a customer retention scheme where they reward loyal customers, they immediately get the benefit of utilizing their data.  The customer buys lots of baby related items, and scans their customer card at the till. The customer purchases are worked out and a voucher is provided for the customers’ next visit, with discounts on the product range just purchased. The customer leaves the store, happy to get discounts on baby items and will be more likely to return to the store. The company would have retained their customer, and at the same time driven sales through effective use of data.

The primary aim for most businesses is to sell as many products as possible, as the more products you sell, the more profit you make. Harnessing the full potentials of your business data is key in making this work. The benefit of this would seem like magic, but it is not, you are just utilizing your business assets.

Excellent customer service and customer retention policy are important elements required for business growth. You may want to ask yourself a question, ‘why will customers want to come back here?’ Your data will help you create that value, retain, and manage your customers, thereby reducing the rate of customers churning. You need to retain existing customers, and not just concentrate on advertisements to attract new customers, whilst neglecting your existing customers. Your business will possibly come up in your customers’ casual conversations with their friends and family. What they say about your business depends on how you have treated them.

Be sure to know that you would have potentially gained yourself tens, hundreds, or even thousands of customers just by treating your customers right and giving them the utmost customer satisfaction. People are more willing to give your business a try if they have received positive feedback from a friend or family. The old age ‘word of mouth’ method still works in today’s business environment. Remember, you do not have the monopoly of the business area you run, you have competitors, and so your customers have options.
A majority of business owners or managers still do not have unequivocal knowledge of how their businesses are performing. It would not be far fetched that their data has been ignored. According to a report by McKinsey Global Institute, a retailer making optimal use of data could increase its operating margin by more than 60 per cent! If you were to make efficient use of your data, you could drive operational efficiency and profitability, reducing expenditure and errors. Your data will give you information about your business, information will garner knowledge, knowledge will enable you do the right thing and make the correct business decisions.

Data is not an illusion; it is a key factor for productivity and profit optimisation, it cannot be neglected. Whether you run a small and medium-sized enterprise (SME) or a multi-national organisation, your data is the tool that is needed to drive sales and increase productivity. If for any reason you are yet to start making use of your business data, there is no better time to start than now.

Article read from Omojuwa.com

Karo is a Business Intelligence/Data Warehouse professional; you can engage him on twitter @K_Orovboni


New Post: The Rise of Big Data

Photo Credit: IntelligenceGuy
Everyone knows that the Internet has changed how businesses operate, governments function, and people live. But a new, less visible technological trend is just as transformative: “big data.” Big data starts with the fact that there is a lot more information floating around these days than ever before, and it is being put to extraordinary new uses. Big data is distinct from the Internet, although the Web makes it much easier to collect and share data. Big data is about more than just communication: the idea is that we can learn from a large body of information things that we could not comprehend when we used only smaller amounts.
In the third century BC, the Library of Alexandria was believed to house the sum of human knowledge. Today, there is enough information in the world to give every person alive 320 times as much of it as historians think was stored in Alexandria’s entire collection -- an estimated 1,200 exabytes’ worth. If all this information were placed on CDs and they were stacked up, the CDs would form five separate piles that would all reach to the moon.
This explosion of data is relatively new. As recently as the year 2000, only one-quarter of all the world’s stored information was digital. The rest was preserved on paper, film, and other analog media. But because the amount of digital data expands so quickly -- doubling around every three years -- that situation was swiftly inverted. Today, less than two percent of all stored information is nondigital.
Given this massive scale, it is tempting to understand big data solely in terms of size. But that would be misleading. Big data is also characterized by the ability to render into data many aspects of the world that have never been quantified before; call it “datafication.” For example, location has been datafied, first with the invention of longitude and latitude, and more recently with GPS satellite systems. Words are treated as data when computers mine centuries’ worth of books. Even friendships and “likes” are datafied, via Facebook.
We can learn from a large body of information things that we could not comprehend when we used only smaller amounts.
This kind of data is being put to incredible new uses with the assistance of inexpensive computer memory, powerful processors, smart algorithms, clever software, and math that borrows from basic statistics. Instead of trying to “teach” a computer how to do things, such as drive a car or translate between languages, which artificial-intelligence experts have tried unsuccessfully to do for decades, the new approach is to feed enough data into a computer so that it can infer the probability that, say, a traffic light is green and not red or that, in a certain context, lumière is a more appropriate substitute for “light” than léger.
Using great volumes of information in this way requires three profound changes in how we approach data. The first is to collect and use a lot of data rather than settle for small amounts or samples, as statisticians have done for well over a century. The second is to shed our preference for highly curated and pristine data and instead accept messiness: in an increasing number of situations, a bit of inaccuracy can be tolerated, because the benefits of using vastly more data of variable quality outweigh the costs of using smaller amounts of very exact data. Third, in many instances, we will need to give up our quest to discover the cause of things, in return for accepting correlations. With big data, instead of trying to understand precisely why an engine breaks down or why a drug’s side effect disappears, researchers can instead collect and analyze massive quantities of information about such events and everything that is associated with them, looking for patterns that might help predict future occurrences. Big data helps answer what, not why, and often that’s good enough.
The Internet has reshaped how humanity communicates. Big data is different: it marks a transformation in how society processes information. In time, big data might change our way of thinking about the world. As we tap ever more data to understand events and make decisions, we are likely to discover that many aspects of life are probabilistic, rather than certain.
APPROACHING "N=ALL"
For most of history, people have worked with relatively small amounts of data because the tools for collecting, organizing, storing, and analyzing information were poor. People winnowed the information they relied on to the barest minimum so that they could examine it more easily. This was the genius of modern-day statistics, which first came to the fore in the late nineteenth century and enabled society to understand complex realities even when little data existed. Today, the technical environment has shifted 179 degrees. There still is, and always will be, a constraint on how much data we can manage, but it is far less limiting than it used to be and will become even less so as time goes on.
The way people handled the problem of capturing information in the past was through sampling. When collecting data was costly and processing it was difficult and time consuming, the sample was a savior. Modern sampling is based on the idea that, within a certain margin of error, one can infer something about the total population from a small subset, as long the sample is chosen at random. Hence, exit polls on election night query a randomly selected group of several hundred people to predict the voting behavior of an entire state. For straightforward questions, this process works well. But it falls apart when we want to drill down into subgroups within the sample. What if a pollster wants to know which candidate single women under 30 are most likely to vote for? How about university-educated, single Asian American women under 30? Suddenly, the random sample is largely useless, since there may be only a couple of people with those characteristics in the sample, too few to make a meaningful assessment of how the entire subpopulation will vote. But if we collect all the data -- “n = all,” to use the terminology of statistics -- the problem disappears.
This example raises another shortcoming of using some data rather than all of it. In the past, when people collected only a little data, they often had to decide at the outset what to collect and how it would be used. Today, when we gather all the data, we do not need to know beforehand what we plan to use it for. Of course, it might not always be possible to collect all the data, but it is getting much more feasible to capture vastly more of a phenomenon than simply a sample and to aim for all of it. Big data is a matter not just of creating somewhat larger samples but of harnessing as much of the existing data as possible about what is being studied. We still need statistics; we just no longer need to rely on small samples.
There is a tradeoff to make, however. When we increase the scale by orders of magnitude, we might have to give up on clean, carefully curated data and tolerate some messiness. This idea runs counter to how people have tried to work with data for centuries. Yet the obsession with accuracy and precision is in some ways an artifact of an information-constrained environment. When there was not that much data around, researchers had to make sure that the figures they bothered to collect were as exact as possible. Tapping vastly more data means that we can now allow some inaccuracies to slip in (provided the data set is not completely incorrect), in return for benefiting from the insights that a massive body of data provides.
Consider language translation. It might seem obvious that computers would translate well, since they can store lots of information and retrieve it quickly. But if one were to simply substitute words from a French-English dictionary, the translation would be atrocious. Language is complex. A breakthrough came in the 1990s, when IBM delved into statistical machine translation. It fed Canadian parliamentary transcripts in both French and English into a computer and programmed it to infer which word in one language is the best alternative for another. This process changed the task of translation into a giant problem of probability and math. But after this initial improvement, progress stalled.
Then Google barged in. Instead of using a relatively small number of high-quality translations, the search giant harnessed more data, but from the less orderly Internet -- “data in the wild,” so to speak. Google inhaled translations from corporate websites, documents in every language from the European Union, even translations from its giant book-scanning project. Instead of millions of pages of texts, Google analyzed billions. The result is that its translations are quite good -- better than IBM’s were--and cover 65 languages. Large amounts of messy data trumped small amounts of cleaner data.
Using big data will sometimes mean forgoing the quest for why in return for knowing what.
FROM CAUSATION TO CORRELATION
These two shifts in how we think about data -- from some to all and from clean to messy -- give rise to a third change: from causation to correlation. This represents a move away from always trying to understand the deeper reasons behind how the world works to simply learning about an association among phenomena and using that to get things done.
Of course, knowing the causes behind things is desirable. The problem is that causes are often extremely hard to figure out, and many times, when we think we have identified them, it is nothing more than a self-congratulatory illusion. Behavioral economics has shown that humans are conditioned to see causes even where none exist. So we need to be particularly on guard to prevent our cognitive biases from deluding us; sometimes, we just have to let the data speak.
Take UPS, the delivery company. It places sensors on vehicle parts to identify certain heat or vibrational patterns that in the past have been associated with failures in those parts. In this way, the company can predict a breakdown before it happens and replace the part when it is convenient, instead of on the side of the road. The data do not reveal the exact relationship between the heat or the vibrational patterns and the part’s failure. They do not tell UPS why the part is in trouble. But they reveal enough for the company to know what to do in the near term and guide its investigation into any underlying problem that might exist with the part in question or with the vehicle.
A similar approach is being used to treat breakdowns of the human machine. Researchers in Canada are developing a big-data approach to spot infections in premature babies before overt symptoms appear. By converting 16 vital signs, including heartbeat, blood pressure, respiration, and blood-oxygen levels, into an information flow of more than 1,000 data points per second, they have been able to find correlations between very minor changes and more serious problems. Eventually, this technique will enable doctors to act earlier to save lives. Over time, recording these observations might also allow doctors to understand what actually causes such problems. But when a newborn’s health is at risk, simply knowing that something is likely to occur can be far more important than understanding exactly why.
Medicine provides another good example of why, with big data, seeing correlations can be enormously valuable, even when the underlying causes remain obscure. In February 2009, Google created a stir in health-care circles. Researchers at the company published a paper in Nature that showed how it was possible to track outbreaks of the seasonal flu using nothing more than the archived records of Google searches. Google handles more than a billion searches in the United States every day and stores them all. The company took the 50 million most commonly searched terms between 2003 and 2008 and compared them against historical influenza data from the Centers for Disease Control and Prevention. The idea was to discover whether the incidence of certain searches coincided with outbreaks of the flu -- in other words, to see whether an increase in the frequency of certain Google searches conducted in a particular geographic area correlated with the CDC’s data on outbreaks of flu there. The CDC tracks actual patient visits to hospitals and clinics across the country, but the information it releases suffers from a reporting lag of a week or two -- an eternity in the case of a pandemic. Google’s system, by contrast, would work in near-real time.
Google did not presume to know which queries would prove to be the best indicators. Instead, it ran all the terms through an algorithm that ranked how well they correlated with flu outbreaks. Then, the system tried combining the terms to see if that improved the model. Finally, after running nearly half a billion calculations against the data, Google identified 45 terms -- words such as “headache” and “runny nose” -- that had a strong correlation with the CDC’s data on flu outbreaks. All 45 terms related in some way to influenza. But with a billion searches a day, it would have been impossible for a person to guess which ones might work best and test only those.
Moreover, the data were imperfect. Since the data were never intended to be used in this way, misspellings and incomplete phrases were common. But the sheer size of the data set more than compensated for its messiness. The result, of course, was simply a correlation. It said nothing about the reasons why someone performed any particular search. Was it because the person felt ill, or heard sneezing in the next cubicle, or felt anxious after reading the news? Google’s system doesn’t know, and it doesn’t care. Indeed, last December, it seems that Google’s system may have overestimated the number of flu cases in the United States. This serves as a reminder that predictions are only probabilities and are not always correct, especially when the basis for the prediction -- Internet searches -- is in a constant state of change and vulnerable to outside influences, such as media reports. Still, big data can hint at the general direction of an ongoing development, and Google’s system did just that.
BACK-END OPERATIONS
Many technologists believe that big data traces its lineage back to the digital revolution of the 1980s, when advances in microprocessors and computer memory made it possible to analyze and store ever more information. That is only superficially the case. Computers and the Internet certainly aid big data by lowering the cost of collecting, storing, processing, and sharing information. But at its heart, big data is only the latest step in humanity’s quest to understand and quantify the world. To appreciate how this is the case, it helps to take a quick look behind us.
There will be a special need to carve out a place for the human: to reserve space for intuition, common sense, and serendipity.
Appreciating people’s posteriors is the art and science of Shigeomi Koshimizu, a professor at the Advanced Institute of Industrial Technology in Tokyo. Few would think that the way a person sits constitutes information, but it can. When a person is seated, the contours of the body, its posture, and its weight distribution can all be quantified and tabulated. Koshimizu and his team of engineers convert backsides into data by measuring the pressure they exert at 360 different points with sensors placed in a car seat and by indexing each point on a scale of zero to 256. The result is a digital code that is unique to each individual. In a trial, the system was able to distinguish among a handful of people with 98 percent accuracy.
The research is not asinine. Koshimizu’s plan is to adapt the technology as an antitheft system for cars. A vehicle equipped with it could recognize when someone other than an approved driver sat down behind the wheel and could demand a password to allow the car to function. Transforming sitting positions into data creates a viable service and a potentially lucrative business. And its usefulness may go far beyond deterring auto theft. For instance, the aggregated data might reveal clues about a relationship between drivers’ posture and road safety, such as telltale shifts in position prior to accidents. The system might also be able to sense when a driver slumps slightly from fatigue and send an alert or automatically apply the brakes.
Koshimizu took something that had never been treated as data -- or even imagined to have an informational quality -- and transformed it into a numerically quantified format. There is no good term yet for this sort of transformation, but “datafication” seems apt. Datafication is not the same as digitization, which takes analog content -- books, films, photographs -- and converts it into digital information, a sequence of ones and zeros that computers can read. Datafication is a far broader activity: taking all aspects of life and turning them into data. Google’s augmented-reality glasses datafy the gaze. Twitter datafies stray thoughts. LinkedIn datafies professional networks.
Once we datafy things, we can transform their purpose and turn the information into new forms of value. For example, IBM was granted a U.S. patent in 2012 for “securing premises using surface-based computing technology” -- a technical way of describing a touch-sensitive floor covering, somewhat like a giant smartphone screen. Datafying the floor can open up all kinds of possibilities. The floor could be able to identify the objects on it, so that it might know to turn on lights in a room or open doors when a person entered. Moreover, it might identify individuals by their weight or by the way they stand and walk. It could tell if someone fell and did not get back up, an important feature for the elderly. Retailers could track the flow of customers through their stores. Once it becomes possible to turn activities of this kind into data that can be stored and analyzed, we can learn more about the world -- things we could never know before because we could not measure them easily and cheaply.
BIG DATA IN THE BIG APPLE
Big data will have implications far beyond medicine and consumer goods: it will profoundly change how governments work and alter the nature of politics. When it comes to generating economic growth, providing public services, or fighting wars, those who can harness big data effectively will enjoy a significant edge over others. So far, the most exciting work is happening at the municipal level, where it is easier to access data and to experiment with the information. In an effort spearheaded by New York City Mayor Michael Bloomberg (who made a fortune in the data business), the city is using big data to improve public services and lower costs. One example is a new fire-prevention strategy.
Illegally subdivided buildings are far more likely than other buildings to go up in flames. The city gets 25,000 complaints about overcrowded buildings a year, but it has only 200 inspectors to respond. A small team of analytics specialists in the mayor’s office reckoned that big data could help resolve this imbalance between needs and resources. The team created a database of all 900,000 buildings in the city and augmented it with troves of data collected by 19 city agencies: records of tax liens, anomalies in utility usage, service cuts, missed payments, ambulance visits, local crime rates, rodent complaints, and more. Then, they compared this database to records of building fires from the past five years, ranked by severity, hoping to uncover correlations. Not surprisingly, among the predictors of a fire were the type of building and the year it was built. Less expected, however, was the finding that buildings obtaining permits for exterior brickwork correlated with lower risks of severe fire.
Using all this data allowed the team to create a system that could help them determine which overcrowding complaints needed urgent attention. None of the buildings’ characteristics they recorded caused fires; rather, they correlated with an increased or decreased risk of fire. That knowledge has proved immensely valuable: in the past, building inspectors issued vacate orders in 13 percent of their visits; using the new method, that figure rose to 70 percent -- a huge efficiency gain.
Of course, insurance companies have long used similar methods to estimate fire risks, but they mainly rely on only a handful of attributes and usually ones that intuitively correspond with fires. By contrast, New York City’s big-data approach was able to examine many more variables, including ones that would not at first seem to have any relation to fire risk. And the city’s model was cheaper and faster, since it made use of existing data. Most important, the big-data predictions are probably more on target, too.
Big data is also helping increase the transparency of democratic governance. A movement has grown up around the idea of “open data,” which goes beyond the freedom-of-information laws that are now commonplace in developed democracies. Supporters call on governments to make the vast amounts of innocuous data that they hold easily available to the public. The United States has been at the forefront, with its Data.gov website, and many other countries have followed.
At the same time as governments promote the use of big data, they will also need to protect citizens against unhealthy market dominance. Companies such as Google, Amazon, and Facebook -- as well as lesser-known “data brokers,” such as Acxiom and Experian -- are amassing vast amounts of information on everyone and everything. Antitrust laws protect against the monopolization of markets for goods and services such as software or media outlets, because the sizes of the markets for those goods are relatively easy to estimate. But how should governments apply antitrust rules to big data, a market that is hard to define and that is constantly changing form? Meanwhile, privacy will become an even bigger worry, since more data will almost certainly lead to more compromised private information, a downside of big data that current technologies and laws seem unlikely to prevent.
Regulations governing big data might even emerge as a battleground among countries. European governments are already scrutinizing Google over a raft of antitrust and privacy concerns, in a scenario reminiscent of the antitrust enforcement actions the European Commission took against Microsoft beginning a decade ago. Facebook might become a target for similar actions all over the world, because it holds so much data about individuals. Diplomats should brace for fights over whether to treat information flows as similar to free trade: in the future, when China censors Internet searches, it might face complaints not only about unjustly muzzling speech but also about unfairly restraining commerce.
BIG DATA OR BIG BROTHER?
States will need to help protect their citizens and their markets from new vulnerabilities caused by big data. But there is another potential dark side: big data could become Big Brother. In all countries, but particularly in nondemocratic ones, big data exacerbates the existing asymmetry of power between the state and the people.
The asymmetry could well become so great that it leads to big-data authoritarianism, a possibility vividly imagined in science-fiction movies such as Minority Report. That 2002 film took place in a near-future dystopia in which the character played by Tom Cruise headed a “Precrime” police unit that relied on clairvoyants whose visions identified people who were about to commit crimes. The plot revolves around the system’s obvious potential for error and, worse yet, its denial of free will.
Although the idea of identifying potential wrongdoers before they have committed a crime seems fanciful, big data has allowed some authorities to take it seriously. In 2007, the Department of Homeland Security launched a research project called FAST (Future Attribute Screening Technology), aimed at identifying potential terrorists by analyzing data about individuals’ vital signs, body language, and other physiological patterns. Police forces in many cities, including Los Angeles, Memphis, Richmond, and Santa Cruz, have adopted “predictive policing” software, which analyzes data on previous crimes to identify where and when the next ones might be committed.
For the moment, these systems do not identify specific individuals as suspects. But that is the direction in which things seem to be heading. Perhaps such systems would identify which young people are most likely to shoplift. There might be decent reasons to get so specific, especially when it comes to preventing negative social outcomes other than crime. For example, if social workers could tell with 95 percent accuracy which teenage girls would get pregnant or which high school boys would drop out of school, wouldn’t they be remiss if they did not step in to help? It sounds tempting. Prevention is better than punishment, after all. But even an intervention that did not admonish and instead provided assistance could be construed as a penalty -- at the very least, one might be stigmatized in the eyes of others. In this case, the state’s actions would take the form of a penalty before any act were committed, obliterating the sanctity of free will.
Another worry is what could happen when governments put too much trust in the power of data. In his 1999 book, Seeing Like a State, the anthropologist James Scott documented the ways in which governments, in their zeal for quantification and data collection, sometimes end up making people’s lives miserable. They use maps to determine how to reorganize communities without first learning anything about the people who live there. They use long tables of data about harvests to decide to collectivize agriculture without knowing a whit about farming. They take all the imperfect, organic ways in which people have interacted over time and bend them to their needs, sometimes just to satisfy a desire for quantifiable order.
This misplaced trust in data can come back to bite. Organizations can be beguiled by data’s false charms and endow more meaning to the numbers than they deserve. That is one of the lessons of the Vietnam War. U.S. Secretary of Defense Robert McNamara became obsessed with using statistics as a way to measure the war’s progress. He and his colleagues fixated on the number of enemy fighters killed. Relied on by commanders and published daily in newspapers, the body count became the data point that defined an era. To the war’s supporters, it was proof of progress; to critics, it was evidence of the war’s immorality. Yet the statistics revealed very little about the complex reality of the conflict. The figures were frequently inaccurate and were of little value as a way to measure success. Although it is important to learn from data to improve lives, common sense must be permitted to override the spreadsheets.
HUMAN TOUCH
Big data is poised to reshape the way we live, work, and think. A worldview built on the importance of causation is being challenged by a preponderance of correlations. The possession of knowledge, which once meant an understanding of the past, is coming to mean an ability to predict the future. The challenges posed by big data will not be easy to resolve. Rather, they are simply the next step in the timeless debate over how to best understand the world.
Still, big data will become integral to addressing many of the world’s pressing problems. Tackling climate change will require analyzing pollution data to understand where best to focus efforts and find ways to mitigate problems. The sensors being placed all over the world, including those embedded in smartphones, provide a wealth of data that will allow climatologists to more accurately model global warming. Meanwhile, improving and lowering the cost of health care, especially for the world’s poor, will make it necessary to automate some tasks that currently require human judgment but could be done by a computer, such as examining biopsies for cancerous cells or detecting infections before symptoms fully emerge.
Ultimately, big data marks the moment when the “information society” finally fulfills the promise implied by its name. The data take center stage. All those digital bits that have been gathered can now be harnessed in novel ways to serve new purposes and unlock new forms of value. But this requires a new way of thinking and will challenge institutions and identities. In a world where data shape decisions more and more, what purpose will remain for people, or for intuition, or for going against the facts? If everyone appeals to the data and harnesses big-data tools, perhaps what will become the central point of differentiation is unpredictability: the human element of instinct, risk taking, accidents, and even error. If so, then there will be a special need to carve out a place for the human: to reserve space for intuition, common sense, and serendipity to ensure that they are not crowded out by data and machine-made answers.
This has important implications for the notion of progress in society. Big data enables us to experiment faster and explore more leads. These advantages should produce more innovation. But at times, the spark of invention becomes what the data do not say. That is something that no amount of data can ever confirm or corroborate, since it has yet to exist. If Henry Ford had queried big-data algorithms to discover what his customers wanted, they would have come back with “a faster horse,” to recast his famous line. In a world of big data, it is the most human traits that will need to be fostered -- creativity, intuition, and intellectual ambition -- since human ingenuity is the source of progress.
Big data is a resource and a tool. It is meant to inform, rather than explain; it points toward understanding, but it can still lead to misunderstanding, depending on how well it is wielded. And however dazzling the power of big data appears, its seductive glimmer must never blind us to its inherent imperfections. Rather, we must adopt this technology with an appreciation not just of its power but also of its limitations.
Article published by the Council on Foreign Affairs