Product Review: Informatica Addresses The Impending Big Data Challenge With Release 9.1

June 13, 2011

Big Data Emerges As A Challenge In A World Of Unstructured Data Proliferation
Data volumes continue to explode with a proliferation of devices, social media tools, video usage, and emerging forms of both structured and unstructured data. The rate of data explosion may be occurring faster than Moore's Law. Organizations now face a significant challenge in dealing with this data deluge. Across the 5 pillars of consumer tech effecting enterprise software, organizations must deal with:

Managing unstructured user generated social interaction data. Massive smartphone adoption and social network usage will converge to create massive data volumes. Twitter now has 106 million users generating over 3 billion requests per day. Most analysts firms forecast at least 300 million smart phones in use among the 1.6 billion mobile devices sold in 2010. Sensing data, call detail records, location based information, digital media, and other sources will lead the individual data explosion.
Coping with explosion in transactional data volumes. A collusion of compliance, regulatory, and digitalization leads to exponential increase in transactional data. Audit and compliance requirements lead to increase in security log files, network and system event logs, emails, and searchable messaging communciations. Add significant automation of business processes and Constellation Research estimates that annual growth in online transactional data and repositories will grow 66%. Most data centers now commit 25% of their infrastructure spend to support storage for data growth.

Informatica 9.1 Focuses On Big Data

Announced June 6th, 2011, Informatica 9.1 is generally available. The new release focuses on four key themes that address the Big Data issue:

Delivering a near open data integration platform. The new release supports Hadoop, big transactional data, and big interaction data. Hadoop support includes connectivity to the file system, HDFS and MapReduce for big data processing. Big transactional data features support EMC Greenplum and other DW appliance vendors soon, in addition to existing Oracle, IBM DB2, IBM Netezza and Teradata connectivity. Big interaction data connectivity support for the Big 3: Facebook, Twitter and LinkedIn.

Point of View (POV): Hadoop provides low cost processing and storage platforms required to address the big data issue. While Informatica 9.1 is designed for a mind boggling petabyte connectivity to OLAP and OLTP data stores today, power users will push for exabyte scale in 12 to 18 months. The new release also delivers a complementary relational/data warehouse appliance package. For social data, organizations will improve their ability to correlate social media signals with transactional data to deliver new insights across the organization. Expect Hadoop and social media connectors to be delivered later in June 2011.
Incorporating master data management technologies with Big Data. The new release incorporates key assets from the Siperian Master Data Management (MDM) acquisition. Users gain new multi-style and multi-domain MDM approaches. Data governance is addressed via resusable data quality policies while proactive data quality builds on Informatica's complex event processing technology to identify and alert users on data quality exceptions.

POV: Informatica's MDM offering remains among the top in shortlists at Constellation Research. The solution delivers true multi-style, multi-domain, multi-deployment, and multi-use capabilities on one technology platform. Users gain the ability to manage data quality rules in source applications that not only propagate downstream, but also take advantage of complex event processing (CEP) to provide proactive alerting (see Figure 1). Informatica's Rule Point CEP engine also provides key geo-aware processing capabilities for advanced scenarios.

Figure 1. Informatica's Self Service Proactive Monitoring

Source: (Informatica)

Self-service empower all users to obtain relevant information while IT remains in control. Self service takes a role based design that addresses the needs of business users, project owners, IT analysts, developers, and data stewards. Users gain more capabilities to access data controls, define rules, and adjust parameters. Project owners can source business entities within applications without having to understand schemas and data models.

POV: Informatica's design point traditionally focuses on power users. The shift to more self-service capabilities provides a good start to addressing the needs of tech savvy business users and reducing the overall cost of ownership.

Adaptive data services provide critical information governance capabilities. Centrally managed processes include multi-protocol data provisioning for data virtualization, integrated data quality for data governance, and policy-driven enforcement for data governance. Organizations will gain the ability to quickly provision data services through ODBC or JDBC, as a web service, or to PowerCenter. Read and write data quality, data quality templates and data quality rules are delivered out of the box.

POV: The information governance problem mirrors the data deluge problem. Disparate tools and disparate governance processes afflict every enterprise. Adaptive data services bring order to a chaotic array of protocols and policies.

The Bottom Line: Organizations Can Apply Solutions Such As Informatica 9.1 To Master The Information Supply Chain
The technology solutions to address complex information management processes often reside in a disparate collection of best of breed apps. With the big data issue looming, organizations must move beyond a technology decision. More than just dealing with an explosion of data, organizations must support business processes that rely on the information supply chain. Eight key areas of the information supply chain include:

Source: Insider Associates, LLC

Classify. Classification schemes tie relationships back to structured and unstructured data. Classifications could include subjects, location, individuals, organizations, relationships, and other metadata.
Transform. Data from multiple source systems require transformation into a compatible format for the destination system. Techniques include translating coded values, sorting, joining, transposing, splitting, disaggregation, etc. Maintaining source system lineage enables the ability to revert or undo changes.
Augment. Augmentation enables users to provide additional information to the data. Examples include third party data from commercial sources, government agencies, and market research firms.
Secure. Data privacy and security should map back to existing policies. In some cases, data should be masked or encrypted. Simple security could include classifications for public, private, and restricted. Most systems will map back to role based security systems.
Deliver. Delivery should include both automated and manual techniques. Subscribing systems could trigger requests based on rules and policies in complex event processing engines or simple thresholds. Manual delivery mechanisms should log back to interaction engines.
Refresh. The half life of cleansed data can range as little as 10 seconds for location based status updates to 3 months for addresses for a transient college student. Information supply chains must continually refresh information to stay relevant.
Archive. Unused data can drive down performance times, create regulatory compliance nightmares, and expose legal risks. By moving inactive data from production systems to backups, organizations can leave important and necessary data indexed and accessible without stymying existing systems.
Retire. Organizations can rid themselves of older data that is no longer required for compliance or is irrelevant. Data can be encrypted and stored offsite or even hard erased.

Organizations that address the challenges of big data will gain significant strategic advantages in better analytical insight, right time engagement, and scalable operational efficiencies.
Your POV.
Will you make the move to address Big Data with Informatica 9.1? Will you consider other options? What will drive you to go with one platform? Add your comments to the blog or send us a comment at r (at) softwareinsider (dot) org or info (at) ConstellationRG (dot) com.
Let us know how we can assist with:

Building a Cloud Strategy
Designing your apps strategy
Crafting a social business strategy
Short list vendors
Negotiate your software contract

Related Links
20110607 Information Week - Doug Henschen "Big Data: Informatica Tackles The High-Velocity Problem"
20110605 IDG News Service – Chris Kanaracus “Informatica Adds Support for 'big Data,' Hadoop”

Related Resources
2011o509 Monday’s Musings: Using MDM To Build A Complete Customer View In A Social Era
20090831 Monday’s Musings: Why Every Social CRM Initiative Needs An MDM Backbone
20110102 Research Summary: Software Insider’s Top 25 Posts For 2010
20101216 Best Practices: Five Simple Rules For Social Business
20110104 Research Report: Constellation’s Research Outlook For 2011
20101004 Research Report: How The Five Pillars Of Consumer Tech Influence Enterprise Innovation
Reprints
Reprints can be purchased through Constellation Research, Inc. To request official reprints in PDF format, please contact [email protected].
Disclosure
Although we work closely with many mega software vendors, we want you to trust us. For the full disclosure policy, stay tuned for the full client list on the Constellation Research website.
Copyright © 2011 R Wang and Insider Associates, LLC All rights reserved.

Cupertino Ray Wang Ray R Wang analytics Apps Strategy Australia business intelligence Business Objects Cloud cloud computing Contract Negotiations contract strategy enterprise applications enterprise apps Enterprise apps strategy Enterprise Business Apps Enterprise Software ERP event report Jeff Word SaaS SaaS strategies SAP SAP Australian User Group SAUG SAUGSummit Software as a Service Sydney upgrade upgrades user conference user event user group event user group events user groups users Apache Software Foundation Attenda Capgemini Capgemini Immediate Cloud BPO cloud integration Cloud options Cordys Demandware Drupal Eloqua Google IBM Infosphere Datastage Kognitio Omniture R "Ray" Wang; Royal Mail Group rwang0 SalesForce.com SoftwareInsider Talis Tech Ecosystem acquisition acquisitions alliances business process outsourcing Cisco Cisco Systems collaboration software custom apps database dell Enterprise Business Apps Vendors enterprise strategy escrows Featured financing options hardware Hewlett Packard hp IBM implementation partners information management IT budgets IT Strategy last mile solutions managed service provider mergers mergers and acquisitions Microsoft middleware middleware platforms Monday's Musings Next Gen Apps next gen enterprise next generation Next generation apps On Demand on-premise operating systems Oracle packaged apps partner ecosystems partners partnerships procurement Research Report resellers Software software escrow Software Insider storage System Integrators technology budgets third party financing trusted advisors vars vendor strategy vertical apps 3PM BearingPoint BPO Hosted legacy hub and spoke Legacy containment Mid-term replacement PaaS extensions Point solutions Private clouds software bill of rights Surround strategies Thrid party maintenance Tuesday's Tip two-tier ERP 2010 Amdocs Ariba Blackboard CA Technologies CDC Software Computer Associates Concur Deltek Exact Software IFS JDA Software Lawson Manhattan Associates NetSuite on-premises Q2 QAD Quarterly Financial Tracker RightNow SII Software Insider Index SuccessFactors Taleo Ultimate Software Customer Centric Cloud Agreements Epicor Lite FinancialForce Insider Insights™ Intacct Microsoft Dynamics CRM OnDemand Oracle Siebel OnDemand Plex Systems Polls Polls and Surveys RightNow Technologies SAP ByD Software Insider Insights™ software licensing software licesing and pricing software ownership software ownership lifecycle software pricing software vendors Surveys WorkDay Alan Webber Altimeter Group Charlene Li David Stanley Deborah Schultz Jeremiah Owyang Lora Cecere Marcia Conner Michael Gartenberg Personal Log Activity streams application development architecture Data deluge emerging technologies Enterprise 2.0 enterprise architecture enterprise collaboration Facebook next gen Point of view product road maps social business software social enterprise social enterprise apps social technologies software trends usability user experience user interaction Web 2.0 business technology Corporate Vision Corporate Vision And Strategy Customer References Ecosystem Feedback Market Execution Ownership Experience Regulatory Requirements saas integration SaaS offensive Social Business Solution Offering user strategy Vendor Selection 2011 B2B E-commerce B2C E-commerce billing crm; customer relationship management customer relationship management (CRM) deployment options e-commerce E-commerce survey eCommerce Integrated support Interactive product catalogs Multi-channel selling NextGen order capture to order fulfillment order completion to cash order fulfillment ot order completion order management order management cycle order to cash Perfect Order Social CRM social media Stakeholder driven Amazon Attensity Baidu business analytics Citrix Classmates.com Consumer Tech Craigstlist.org eBay Four S's of Enterprise Class Acceptance IDC Information Week Information Week 500 innovation Internet of Things Jive LinkedIn Lithium Meebo mobile next gen cio next gen CIO’s next gen IT leaders NexTag Orkut PlexSystems Priceline.com Rackspace safe scalable SCRM secure Shopping.com Shopzilla Skype Streetline Networks sustainable Target.com Twitter Unified Communications Video Walmart.com Yahoo! YouTube Moscone oow10 Oracle Open World san francisco softwwareinsider vendor events Cloud Wars HCM oracle exalogic elastic cloud Oracle Fusion Apps Oracle Partner Network Oracle-Sun PaaS Market2Lead Marketo Silvepop Software Insider Tech Ecosystem Model™ Unica Vtrenz appliance market Kleiner Perkins Léo Apotheker News Analysis Ray Lane Government Contacting Input project based solutions project management public sector Android Java sun Sun Microsystems Charles Phillips Infor Infor Global Solutions GmbH IPO Jim Schaper Microsoft Azure Microsoft Windows Phone 7 #crsch Constellation Constellation Research CRCH CRG CA Epicor IFS North America informatica Kenexa Pervasive Software Progress Software Saba SoftwareAG subscription revenues VMWare lawsuit tomorrow now B2B B2B market strategy B2C best practices bill of rights brand monitoring Business 2.0 business drivers business process optimization business requirements business strategy business value CEO cfo CIO Cloud Security CMO Convergence cost reduction CTO custom development DaaS data integration data quality data stewardship disruptive disruptive technologies disruptive technology early adoptions Enterprise Software Licensee Bill of Rights future of business Gov 2.0 gov20 Government Contracting govtech hybrid hybrid deployments information management matrix innovation insights IT services firms Legacy Optimization lessons learned license fees license management license parking license policy license returns line of business Maintenance maintenance fees management strategy market strategy marketing P2P pace of change pace of technology adoption Private Cloud relationship managmeent relationships SaaS Bill of Rights SaaS escrow sales strategies service economy services based industries social campaign tracking social customer insights social event management social media monitoring social service social support social support insights software appliances software contract reviews Software Ecosystems software maintenance software revenue recognition rules sourcing stack wars subscription pricing technology adoption technology partnerships technology platforms third party maintenancce Third Party Maintenance Trends use cases used software value vmforce df10 Heroku Marc Benioff platform as a service back maintenance fees shelfware collaboration customer experience management socbiz social marketing insights IaaS integration software version Chirag Mehta information broker Epicor 9 license credits to new products master data management MDM Microsoft Azure Services Platform Microsoft Business Solutions Microsoft Dynamics MIcrosoft Dynamics ERP Microsoft Dynamics GP Research Summary Scout Labs thanks #CRInc 3PL auto ID Canada Constellation Research Inc. ConstellationRG Crosslink Distribution Hudson's Bay Jeff Ashcroft LeADS logistics PWC retail RFID s&op Semantic Web Social CMO social supply chain Strategic Logistic Partners supply chain Supply Chain Network Project third party logisticis Tibbet & Britten Toronto Bloom & Wallace Board of Advisors HR hr technology human resources Naomi Bloom Naomi Lee Bloom ACMA AT&T Navigation Services Australian Communications and Media Authority BI BrightKite Cellcom CTIA EchoEcho EFF Electronic Frontier Foundation Electronic Privacy Information Center Foursquare Foursqure Global Navigator and Maps Google Maps Google Navigation Google. Latitude Gowalla LBS LOC-AID Location Based Services Loopt Mark Zuckerberg OnStar Phantom Alert Plazes PleaseRobMe predictive analytics Rummble Sam Altman smartphones social social analytics Sprint Family Locator Trapster Tripit web analytics WHERE Yahoo! Maps Yelp Yowza!! Zhiing Adrian Bowles and compliance Atelier Research Boston College Clean Tech Corporate Social Responsibility CSR Drexel University Energy and carbon management Giga Information Group GRC GTE Inc. New Science Associates NYU Object Management Group Press Release SIG411 SUNY-Binghamton Sustainability Yourdon #archat Academics analyst relations analyst strategy authors event producers iiar industry analysts influencer relations management consultants media moguls peer to peer faciliators training gurus Gamification gsummit Zynga Kieran Barr Discovery early adopters Evangelization Experimentation Formalization maturity models Peer to Peer Realization Social Business Maturity Social Business Maturity Models social business strategist social commerce State of Social Business Amy Wilson emergin technologies human capital management PeopleSoft social recruiting @buchanla @grahamhill design thinking EMEA Graham Hill Laurence Buchanan MIcrosoft Dynamics CRM Trip Report Wipro Debra Lilley Fujitusu Oracle User Group UKOUG America's SAP User Group ASUG Bridgette Chambers Accenture Deloitte Deloitte Consulting disruptive tech ecosystem HCL Technologies India Infosys Mahindra Satyam Nasscom strategic advisor Tata Consulting Services TCS thought leadership Apple Art Levinson Carol Bartz Eric Schmidt Genentech Jack Bienko John Chambers John Doerr John Hennessy Larry Ellision Larry Ellison Netflix President Obama R "Ray" Wang Monday’s Musings Reed Hastings SBA Silicon Valley Stanford University Startup America Steve Jobs Steve Westly Tech Halo Westly Group envy gluttony greed lust pride sloth wrath Business by Design Business Suite 7 ByD CeBit John Wookey sales force automation SAP Business By Design SAP Business Suite 7 SAP OnDemand Large Enterprise agresso Ensw Glovia Intit JD Edwards Lawson Software Sage Group Syspro Unit4 Five Elements Of Enterprise Class Five Phases Of Enterprise Acceptance Tags Comments Date Analytics Title Author Categories Tags Comments Date Analytics Monday’s Musings: The Race For Enteprise Class Consumer Tech - Draft Edit | Quick Edit | Trash | Preview Monday's Musi Cambridge Technology Partners customer experience CXP Disney Guthy Renker Carl Icahn Chuck Phillips George Soros Harry Debes Lawson M3 Lawson S3 Romesh Wadhwani business process Conflict of Interest Corruption IT Services sourcing advisor worst practices Activity activity feeds enterprise class Radian6 socialytics $GOOG $MSFT $YHOO and BuyWithMe Bloomspot Groupon LivingSocial OpenTable Scoutmob Tippr Microsoft BPOS Microsoft Dynamics AX Microsoft Dynamics Convergence Microsoft Dynamics NAV Microsoft Dynamics SL Microsoft Office Microsoft Partners Intuit SAGE software mainteance Sunguard vendor management Amazon EC2 Amazon Web Services Amazon.com backup disaster recovery high availability HootSuite IBM Blue Cloud public cloud Quora Reddit customer data integration #daac11 #ssve boomi Compellent EqualLogic Exanet KACE Networks Message One Ocarina Networks Perot Systems SecureWorks security The Networked Storage Company customer engagement Customer Hubs data governance Informatica Customer Data Forum information supply chain Adobe application life cycle management ASUG365 CA/Hyperformix EMC ERP-Link Flexera Software Hayes Technology Group HiLn Solutions IBM Alloy IBM DB2 IBM DB2 For SAP IBM Infosphere Optim Information Builders IntelliCorp iWay Microsoft Duet Microsoft SQL Server Microsoft SQL Server for SAP Panaya Revelation Software Concepts rimini street SAPPHIRE SapphireNow Susen Software Synactive Tibco TIBCO ActiveSpaces Data Grid TIBCO Hawk Tidal Software Used Soft virtualization West Trax Winshuttle social sales insights Big Data Engagement Apps Mobile Enterprise Aaron Pearson Alan Silberberg Alex Willaims AMP Annalie Killian ASUG News Augemented Reality Awards BallouPR Barney Beal Bob Egan CBS News Chris Kanaracus CloudAve Colette Ballou ComputerWorld UK Constellation SuperNova Awards Courtney Bjorlin CRM Magazine David Brousell David Myron Douglas Henschen Erin Kinikin Esteban Kolsky Frank Scavo IDG News Service Info Today Jason Maynard John Furrier Kash Rangan Kewal Varia Krishnan Subramaninan KrishWorld Larry Dignan Managing Automation Maribel Loepz Marshall Kirkpatrick Marshall Lager Merrill Lynch Mike Simons Paul Greenberg Paul Papadimitriou ReadWriteCloud ReadWriteEnterprise ReadWriteWeb Robert Scoble Sepharim Group Spark Communications SuperNova Awards Susan Thomas Tech Target The 56 Group LLC theMIX Agency ThinkJar Third Idea Consulting LLC Thomas PUblishing Thomas Wailgum Trainer Communications Vanessa Camones Weber Shandwick Wells Fargo Securities ZD Net Executive Profiles Get Satisfaction Peter Lorenz SAP AG Thursday's Tech Showcase Aaron Levie Adam Rogers Adobe Systems Alan F. Nugent Alcatel-Lucent Alistair Rennie Aneel Bhusri Apprenda Attensity Group Bill Jacarsuo Bob Kelly Box.net Brad Smith BunchBall Charlie Isaacs Clarabridge Danile Debow David Bankston David Sacks Ed Van Siclen Eugene Lee Gaurav Dhillon Greg Gianforte Ian Hersey INgage Networks J.B. Holston Jive Software Joe Fernandez Klout Lithium Technologies Loic Le Meur Lyle Fong Marcel LeBrun Mark Symonds Michael Ni Moxie Software Mzinga NewsGator Parker Harris Rajat Paharia Ram Menon Randy Guard Rick Nucci Rob Howard Rob Tarkoff Ryan Holmes Rypple SAS Institute Seesmic Sid Banerjee Sinclair Schuller SnapLogic SocialText Telligent TIBCO Software Tien Tzuo Tom Kelly Tony Zingale Verafirma Wendy Lea Yammer Zach Nelson Zuora Hadoop Map Reduce Product Review