How IBM Monitors Its Sametime Environment — An Update

Bret Swedeen, System Tools Developer, IBM Global Services


November, 2009


Discover how IBM monitors the health of one of the largest implementations of IBM Lotus Sametime in the world, with over 200,000 concurrent Sametime connections across multiple servers during peak usage. Sametime admins and the developers who support them gain insight into readily available monitoring utilities and how to display them in a dashboard. The dashboard provides instant, real-time data on usage and servers for monitoring the health of the current environment. Also see how IBM analyzes when to add resources, the best times to schedule upgrades or changes, and more.


Since I wrote this article in 2007, we’ve started to monitor individual Sametime processes at IBM using Simple Network Management Protocol (SNMP). The process is a little tricky, so I've added how-to instructions for collecting SNMP statistics to help adventurous administrators build their own collection tool. We've also added an expanded Sametime Monitoring Dashboard to our arsenal of custom tools, so I’ve included a few screen shots of this new tool to further demonstrate what is possible.
In my work for IBM on the internal deployment of IBM Lotus Sametime, I’m often asked, “How does IBM monitor its Sametime environment?” Some people expect an answer that reveals a secret, non-intuitive approach to monitoring, and they are surprised by the simple approach IBM takes toward this task. In short, we IBM Sametime administrators use a combination of real-time monitoring and trend analysis of historical data to completely understand our Sametime environment. It doesn’t get any more tricky or complex than that. Yes, we created some collection routines and a display tool to aid our two-pronged monitoring approach; however, the mechanics are straightforward.
In this article, I showcase IBM’s monitoring approach to explain the readily available mechanisms by which we collect both current Sametime usage and server data, and how we make the information available to admins in a unified dashboard plug-in, also known as a miniApp. Admins check this dashboard to see whether trouble is brewing. The environment also delivers historical data for trend analysis in support of assessing infrastructure needs and planning any upgrades or maintenance. I also discuss how we perform additional calculations and then store all data in a Rational database.
To put everything into context, I’ll start with a brief explanation of the overall Sametime server monitoring architecture.

Would you like to see the full version of this article?

If you are an electronic license holder to THE VIEW, please log in to view this article.

If you would like information about becoming an electronic license holder — and having 24/7 unrestricted access to all articles and content in THE VIEW Online Knowledgebase — click here to see the available subscription options.

Or call 1-781-751-8813 to speak directly with a subscription and licensing specialist about customized access for you and your team.