Data center management
Data center management is the collection of tasks performed by those responsible for managing ongoing operation of a data center This includes Business service management and planning for the future.
Historically, data center management was seen as something performed by employees, with the help of tools collectively called Data Center Infrastructure Management (DCIM) tools. Now an outsourcing option exists: Data-center Management As A Service - DMaaS.
- 1 Coopetition
- 2 Focus
- 3 Newer developments
- 4 Data center asset management
- 5 Data center infrastructure management
- 6 Operations
- 7 Tech Support
- 8 Preventive maintenance
- 9 Managing the capacity of a data center
- 10 Top data centers and service providers
- 11 See also
- 12 References
- 13 External links
Data center management is a growing major topic for a growing list of large companies who both compete and cooperate:
Hardware/software vendors who are willing to live with coopetition are working on projects such as "The Distributed Management Task Force" (DMTF) with a goal of learning to "more effectively manage mixed Linux, Windows and cloud environments."
With the DMTF a decade old, the list of companies is growing, and also includes companies much smaller than IBM, Microsoft, et al.
Among the topics currently being explored are:
- Securing Data Center Networks
- Disaster Recovery
- Government restrictions
- Estimated cost of downtime regarding:
Business service management
IBM notes that major problems often happen in the grey areas, particularly due to errors in the interfaces, and focuses on critical failures. Sufficient redundancy should allow failures in non-critical areas to protect the business from being affected.
- promotes a customer-centric and business-focused approach to service management, aligning business objectives with IT or ICT from strategy through to operations
- is positioned above IT Service Management (ITSM)
Tools that help BSM include:
- A modeling language
- A common dashboard, allowing the data center to see problems before business customers do.
Remote Data Center Management allows offsite experts to watch for situations needing their timely intervention at a lower cost than having such staff be onsite 24/7/365.
Data center asset management
Data center asset management (also referred to as inventory management) is the set of business practices that join financial, contractual and inventory functions to support life cycle management and strategic decision making for the IT environment. Assets include all elements of software and hardware that are found in the business environment.
Hardware Asset Management
Hardware asset management entails the management of the physical components of computers and computer networks, from acquisition through disposal. Common business practices include request and approval process, procurement management, life cycle management, redeployment and disposal management. A key component is capturing the financial information about the hardware life cycle which aids the organization in making business decisions based on meaningful and measurable financial objectives.
IT asset management generally uses automation, to manage the discovery of assets, so inventory can be compared to license entitlements. Full business management of IT assets requires a repository of multiple types of information about the asset, as well as integration with other systems such as supply chain, help desk, procurement and HR systems and ITSM.
Data center infrastructure management
Data center infrastructure management (DCIM) is the integration of information technology (IT) and facility management disciplines to centralize monitoring, management and intelligent capacity planning of a data center's critical systems. Achieved through the implementation of specialized software, hardware and sensors, DCIM enables common, real-time monitoring and management platform for all interdependent systems across IT and facility infrastructures.
DCIM products can help data center managers identify and eliminate sources of risk and improve availability of critical IT systems. They can also be used to identify interdependencies between facility and IT infrastructures to alert the facility manager to gaps in system redundancy, and provide dynamic, holistic benchmarks on power consumption and efficiency to measure the effectiveness of "green IT" initiatives.
It's important to measure and understand data center metrics, including those regarding energy efficiency and utilization of servers, storage, and staff. In too many cases, disc capacity is vastly under-utilized and servers run at 20% utilization or less. More effective automation tools can also improve the number of servers or virtual machines that a single admin can handle.
DCIM providers are increasingly linking with computational fluid dynamics providers to predict complex airflow patterns in the data center. The CFD component is necessary to quantify the impact of planned future changes on cooling resilience, capacity and efficiency.
Information technology operations, or IT operations, are the set of all processes and services that are both provisioned by an IT staff to their internal or external clients and used by themselves, to run themselves as a business. The term refers to the application of operations management to a business's technology needs.
As lights out operations increased, less of the staff are located near corporate headquarters. Gartner defines IT operations as "the people and management processes associated with IT service management to deliver the right set of services at the right quality and at competitive costs for customers."
- Tier 1: Basic help desk - initial point of contact, including software opening a trouble ticket. Information available to its personnel include FAQ and a basic knowledge base.
- Tier 2: In-depth technical support
- Tier 3: Expert product and service support.
The extra tiers are:
- Tier 0: Self help (i.e. by the end user)
- Tier 4: Outside support for "items not directly serviced by the organization"
Access to varying levels of support for products and services to in-house employees and corporate customers, providing information and troubleshooting is via various channels such as toll-free numbers, websites, instant messaging, or email.
Help desk professionalism
As the incoming phone calls are random in nature, help desk agent schedules are often maintained using an Erlang C calculation.
Companies with custom application software may also have an applications team who are responsible for the development of in-house software. The help desk may assign to the applications team such problems as finding software bugs. Requests for new features or information about the capabilities of in-house software that come through the help desk are also assigned to applications groups.
The help desk staff and supporting IT staff may not all work from the same location. With remote access applications, technicians are able to solve many help desk issues from another work location or their home office. While there is still a need for on-site support to effectively collaborate on some issues, remote support provides greater flexibility.
Some companies and organizations provide discussion boards for users of their products to interact; such forums allow companies to reduce their support costs without losing the benefit of customer feedback.
Outsourcing technical support
Many organizations relocated their technical support departments or call centers to countries or regions with lower costs. Dell was amongst the first companies to outsource their technical support and customer service departments to India in 2001, but then reshored. There has also been a growth in companies specializing in providing technical support to other organizations. These are often referred to as MSPs (Managed Service Providers).
For businesses needing to provide technical support, outsourcing allows them to maintain a high availability of service. Such need may result from peaks in call volumes during the day, periods of high activity due to introduction of new products or maintenance service packs, or the requirement to provide customers with a high level of service at a low cost to the business. For businesses needing technical support assets, outsourcing enables their core employees to focus more on their work in order to maintain productivity. It also enables them to utilize specialized personnel whose technical knowledge base and experience may exceed the scope of the business, thus providing a higher level of technical support to their employees.
A common scam typically involves a cold caller claiming to be from a technical support department of a company like Microsoft. Such cold calls are often made from call centers based in India to users in English-speaking countries, although increasingly these scams operate within the same country. The scammer will instruct the user to download a remote desktop program and once connected, use social engineering techniques that typically involve Windows components to persuade the victim that they need to pay in order for the computer to be fixed and then proceeds to steal money from the victim's credit card.
Preventive maintenance (or preventative maintenance (PM)) is ongoing scheduled inspection intended to detect and correct incipient failures either before they occur or before they develop into major problems such as downtime.
Managing the capacity of a data center
There is a need to know what will be needed, and when. Data must continually be collected regarding usage of power/energy, computing power, data storage and networking/telecommunications. Plans must include awareness of cooling and space requirements.
Sometimes analysis of this data, and comparison to industry norms, can be outsourced. The balance for the need to focus more on data collection or analysis depends on current utilization levels: prior to 50%, the focus can stay more on data collection. Beyond 75%, the focus must shift to analysis, in preparation for upgrades, replacements and expansions. The data center is a resource in its own right.
Top data centers and service providers
According to Cloudscene's Leaderboard for Q1 2018, data center operators are ranked “based on both data center density (total operated data centers)", as well as "the number of listed service providers in the facility". Cloud service providers are ranked based on "connectivity (the total number of PoPs) for the region.” Chosen from a pool of more than 6,000 providers, the rankings are as follows:
- Q1, 2018 Top Data Center Operators Worldwide
|2||Digital Realty||Interxion||NEXTDC||Global Switch|
|3||CoreSite||Telehouse||Vocus Communications||NTT Communications|
|4||Zayo||Digital Realty||Global Switch||GPX Global Systems|
|5||Level 3 Communications||Global Switch||YourDC||ST Telemedia Global Data Centres|
|6||Cologix||Level 3 Communications||Macquarie Telecom||Netmagic Solutions|
|8||TierPoint||Colt Technology Services||Interactive||Digital Realty|
|10||QTS Realty Trust||Orange Business Services||Data Centre Limited||OneAsia Network|
- Q1, 2018 Top Service Providers Worldwide
|1||Zayo||Colt Technology Services||Telstra||Colt Technology Services|
|2||Level 3 Communications||EuNetworks||Vocus Communications||PCCW Solutions|
|3||Verizon||Cogent Communications||PIPE Networks||Tata Communications|
|4||Crown Castle||Zayo||Optus||PCCW Global|
|5||AT&T||Level 3 Communications||NextGen Group||Telstra|
|6||Cogent Communications||BT||AAPT||NTT Communications|
|9||Comcast||Orange Business Services||Zencross Connect||China Telecom|
- Application performance management
- Business process management
- Business transaction management
- Business transaction performance
- Call center
- Configuration management database
- Data center environmental control
- ISO/IEC 19770
- License manager
- Network administrator
- Service-level agreement
- Software licensing audit
- Team service management
- Comparison of issue-tracking systems
- Comparison of help desk issue tracking software
- Customer service
- Support automation
- Technical support
- Call board
- Help desk software
- "What Startups in Amazon's Ecosystem Should Learn From VMware". The New York Times.
their existing data center management ...
- "Data Center Management".
- Ann Bednarz (May 24, 2018). "Data center management - What does DMaaS deliver that DCIM doesn't". Network World.
- "The future of Data Center Management: From DCIM to DMaaS". EnterpriseTech.com. June 15, 2018.
- "What is a Service Level Agreement". Datamation. April 17, 2017.
- "Dell Makes Moves to Survive in Cloud-Centric World". NYTimes.com. August 21, 2012.
- "Google and I.B.M. Join in 'Cloud Computing' Research". The New York Times. October 8, 2007.
- "Yahoo, Intel and HP Form Cloud Computing Labs". The New York Times. July 29, 2008.
- "Coinages That Last". The New York Times. August 9, 2003.
Many buzzwords, like coopetition and thought-leading,
- "The Online Travel Landscape Is Getting Crowded". NYTimes.com. November 7, 2005.
... which Internet analysts love to call "coopetition.".
- "Meeting virtualization management challenges". The New York Times. October 27, 2008.
- The Times article mentions "a crop of next-tier vendors, start-ups and open source players."
- "Data Center Management". The Data Center Journal. Retrieved October 28, 2018.
- Matt Hancock (October 26, 2018). "Power struggle". Computer Weekly.
a row is brewing over an EU plan to curb datacentre energy use
- David Gewirtz (May 30, 2017). "The astonishing hidden and personal costs of IT downtime (and how predictive analytics might help)".
- "Flights Cancelled for more than 75,000 passengers".
- "What is business service management (BSM)?".
- Jenko Gaviglia. "Business Service Management" (PDF). IBM.com.
- A. K. Ghose. "The business service representation language" (PDF). SemanticScholar.org.
- Bednarz (June 2010). "Targeting hybrid IT environments". Computerworld.
- "Remote Data Center Management".
- Quentin Hardy (November 17, 2012). "Hard Times Could Create a Tech Boom".
- "In 2014, Proactive UPS Maintenance is Essential for all Data Center Managers" (PDF).
UPS-redundant configurations, providing backups for backups that have their own backups.
- or IT asset management (ITAM)
- "IT Asset Management (ITAM)". Gartner. May 18, 2013. Retrieved 2019-01-15.
- "Tracking All the Data: Data Center Infrastructure Management ..." ECmag.
(DCIM) software enables ... integration ...
- "Data Center Infrastructure Management - Data Center Handbook".
- "Measure and manage the risk inherent in your IT infrastructure". Network World. August 13, 2010.
- Tom Coughlin (September 9, 2018). "Green Computing And Storage". Forbes.
- "Mission: Green Computing" by Supermicro Introduces Total Cost". NYTimes.com. August 20, 2018.
- "Measuring Data Center Efficiency: Easier Said Than Done". Dell.com. Archived from the original on 2010-10-27. Retrieved 2012-06-25.
- "Computational-Fluid-Dynamic (CFD) Analysis | Gartner IT Glossary". gartner.com. Retrieved 2014-08-27.
- "Computer Operators".
- "What Do (Business, DevOps, People, Sales) Operations People Do?". theoperationsguy.com. October 23, 2012.
- Site Reliability Engineering: How Google Runs Production Systems. O'Reilly. 2016. ISBN 978-1-491-92912-4.
- "From Manhattan to Montvale". The New York Times. April 20, 1986.
- Ashlee Vance (December 8, 2008). "Dell Sees Double With Data Center in a Container". NYTimes.
- "IT Operations - Gartner IT Glossary". gartner.com. February 8, 2012.
- By Quentin Hardy (October 30, 2012). "An Update for the Corporate Help Desk". The New York Times.
- Joe Hertvik (July 7, 2016). "IT Support Levels Clearly Explained: L1, L2, L3, and More".
- "IT Help Desks Not Just For Large Enterprises".
- "Students - Information technology - Calvin College" (PDF). Calvin College. Retrieved March 23, 2018.
- "Help Desk vs Service Desk vs ITSM".
- "Technical support for the neighbours". BBC News. 2005-03-28. Retrieved 2008-03-06.
- "How to Use Online Forums". Inc.
- Dell moves outsourced jobs back to U.S. shores
- Berkley, Susan; Maggie Klenke. "Call Centre Trends". The Great Voice Company. Retrieved 2008-05-02.
- Perkins, Bart (2004-11-08). "Outsourcing: First Ask Why?". Computerworld Management. Retrieved 2008-05-06.
- Arthur, Charles (18 July 2012). "Virus phone scam being run from call centres in India". Guardian. Retrieved 31 March 2014.
- Ben Zimmer (April 18, 2010). "Wellness". The New York Times.
Complaints about preventative go back to the late 18th century ... ("Oxford English Dictionary dates preventive to 1626 and preventative to 1655) ..preventive has won"
- "What is Preventive Maintenance?". MicroMain.com.
- "What is preventive maintenance?". BusinessDictionary.com.
- Samir Mehra (September 11, 2018). "Capacity Planning in the Era of Infinite Capacity".
- "Data Center Capacity Planner Jobs, Employment". indeed.com (job search).
293 Data Center Capacity Planner jobs available on Indeed.com
- Thomas A. Limoncelli; Strata R. Chalup; Christina J. Hogan. "Room to grow: Tips for data center capacity planning". Computerworld.
- since this consumes both computing and storage resources
- J Xu; M Zhao; J Fortes; R Carpenter (2007). "On the use of fuzzy modeling in virtualized data center management". IEEE.org.
- "Cloudscene Rankings: Top Data Centers & Service Providers Worldwide". Cloudscene.
- ITOperationsAnalytics.net: What is IT Operations