1 User Manual-1909 NMC Intelligent Monitoring 用友网络科技股份有限公司 2 Copyright © yonyou Network Technology Co., Ltd. All rights reserved Without the writte its content does not represent the commitment made by yonyou Group. n permission of yonyou Group, any part or the whole content of this manual may not be reproduced, copied, translated or reduced for any purpose. Please note that the content of this manual may be changed without prior notice, and 用友网络科技股份有限公司 3 Table of Contents Copyright ....................................................................................................................................2 Table of Contents .......................................................................................................................3 Part I: Start Using NMC .............................................................................................................5 Chapter I Chapter I: Overview .................................................................................... 5 Chapter II Deployment and Usage ............................................................................. 10 Part II: Real-time Monitoring ...................................................................................................17 Overview ............................................................................................................................. 17 Chapter III OverView .................................................................................................. 18 Chapter IV Process Monitoring ................................................................................... 19 Chapter V Thread Monitoring .................................................................................... 28 Chapter VI Database Monitoring ................................................................................. 35 Chapter VII Client Monitoring ...................................................................................... 39 Chapter VIII System Monitoring .................................................................................... 44 Chapter IX Best Practices ............................................................................................ 45 Part III: Off-line Analysis.........................................................................................................47 Overview ............................................................................................................................. 47 Chapter X Message Center ......................................................................................... 48 Chapter XI Log Extract ................................................................................................ 60 Chapter XII Log Recording .......................................................................................... 66 Chapter XIII Log Analysis ............................................................................................. 69 Chapter XIV Global Performance Report ...................................................................... 73 Chapter XV Business Data Pattern Report .................................................................... 79 Chapter XVI AWR ......................................................................................................... 81 Chapter XVII Best Practices .................................................................................... 83 Part IV: Cluster Management ...................................................................................................86 Overview ............................................................................................................................. 86 Chapter XVIII Security Settings ................................................................................ 86 Chapter XIX Process Management ................................................................................ 87 用友网络科技股份有限公司 4 Chapter XX Parameter Settings .................................................................................... 88 Chapter XXI Patch Management 5 ................................................................................. 89 Chapter XXII Patch Management 6 ......................................................................... 97 Chapter XXIII Best Practices .................................................................................. 102 Part V: Other ..........................................................................................................................104 Overview ........................................................................................................................... 104 Chapter XXIV Network Testing .............................................................................. 104 Chapter XXV Client Performance Optimization .................................................... 109 Chapter XXVI SPR .................................................................................................. 111 用友网络科技股份有限公司 5 Part I: Start Using NMC Chapter I Chapter I: Overview Server clustering and load balancing are the only way to ensure expanding applications and stability and efficiency. Starting cluster sites can be managed manually when they are digitally. However, when the site becomes dozens or hundreds, it will undoubtedly become a disaster to rely on manual management and maintenance, so an automatic, intelligent and visual active monitoring system-NMC came into being. NMC is an advanced platform for performance monitoring, performance analysis and system management. NMC will greatly improve the efficiency of performance monitoring, performance analysis and system management, thus obtaining better service quality and customer satisfaction. For large-scale and complex IT systems, NMC are indispensable. In terms of monitoring, it integrates advanced performance monitoring ideas, such as distributed monitoring services, multi-layer penetration, information collection and analysis separation, event-driven snapshot mechanism, etc. The monitored objects cover all aspects of application system performance concerns, including databases, middleware, clients, etc. In terms of performance analysis, it has powerful log extraction and analysis capabilities, and provides rich chart display functions. It can extract and analyze logs of application systems, JVM, threads, snapshots, CPU, memory, clients and online users, and it also presets a large number of analysis models. In terms of system management, it adopts the idea of active discovery and active management, which can actively manage when middleware fails, thus improving system reliability. Integrated patch management tools can help administrators quickly deploy and roll back patch processes, thus greatly improving the efficiency of patch management in cluster environment. Terms and Definitions Monitoring Domain: A collection of a service center and a set of monitoring instances managed by the service center. These monitoring instances can be distributed across multiple physical servers. The monitoring instance consists of a set 用友网络科技股份有限公司 6 of monitoring services. In a monitoring domain, all monitoring services managed by monitoring instances register their monitoring resources with the service center through the service manager, and the monitoring service resources are uniformly released by the service center. Service Center: The core management process of a monitoring domain is used to register all service resources in the monitoring domain and provide service startup control, access log, service request routing, security check and dynamic parameter refresh functions. There is only one service center in a monitoring domain, and all monitoring instances must set up service center servers. The monitoring client only needs to connect to the service center server to access the service resources of the entire monitoring domain. The relationship between monitoring domain and service center is shown in Figure 1-1: 监控域服务中心服务器监控实例注册NMC客户端资源发布调用调用 Figure 1-1 Relationship between Monitoring Domain and Service Center Message: Warnings generated by the monitoring service. Messages can drive the generation of snapshots. For example, thread monitoring found that a task took more than 3 minutes to execute and a message was generated. The message drives the generation of snapshots of middleware and databases. Snapshot: Detailed running status information generated when the system is running. This information is used to help locate the problem in detail. Such as thread detail stack and SQL stack of middleware. Execution plan of the database, etc. Snapshot message binding. 用友网络科技股份有限公司 7 Technical architecture 监控实例监控管理服务中心服务资源中心连接适配Email|Http(s)|JMX服务定位器连接适配器Email|Http|JMX服务管理器服务中心管理器访问调用使用注册调用监控客户端(UI Plusgins)监控服务池 Figure 1-2 NMC Technical Architecture NMC client is an integrated monitoring UI and a plug-in-based UI framework. The client communicates with the service center through a variety of protocols, including HTTP(s), Email, JMS; After the service center receives the client request, the service center manager locates the service through the service locator and routes the requested service resources to the corresponding monitoring process; The monitoring process transfers the request to the service manager, through which the request is transferred to a specific service instance for processing; There can be one or more monitoring instances in a monitoring domain. 用友网络科技股份有限公司 8 Relationships between Monitoring Services 服务实例池主要服务消息中心监控日志进程监控线程监控数据库监控CPU补丁管理内存日志分析缓存监控IO客户端监控穿透穿透穿透穿透生成生成生成生成生成生成生成分析生成穿透 Figure 1-3 Relationships between Monitoring Services One of NMC's important advantages is that services can penetrate each other. For example, the penetration of process to thread and then to database information, and the penetration of client to thread information. In a multi-level architecture system, the penetration of monitoring information between layers can help you analyze problems more systematically and accurately. Monitoring services will generate logs and messages, which can further become data sources for log analysis. Core Features Integrated platform for performance monitoring, problem analysis and system management; Provide HTTP, Email, OFFLine and other connection or usage modes; Support multiple monitoring domains. Monitoring domain supports multiple servers and various types of monitoring services. Provide seamless and comprehensive monitoring system from client to middleware and database; Support the mutual penetration of middleware layer and database layer information; Support the mutual penetration of client logs and server logs; Provide powerful middleware process-level monitoring and management functions; Provide fine-grained thread monitoring function; 用友网络科技股份有限公司 9 Provides powerful and flexible log extraction and analysis functions; Provide an event-driven function of active monitoring and active management; Integrate patch management functions; Realize the isolation between the security of monitoring service and the security of monitoring target. Benefits A high degree of insight into the system can be obtained. Using the real-time monitoring of NMC can obtain real-time control of the running state of the system, whether at the level of system summary or fine-grained thread, it can be seen in a glance. Analysis of logs and messages can evaluate the overall operation status of the system and predict risks. Discover and locate problems quickly. Positioning performance problems can be quickly found through real-time monitoring or offline analysis. Compared with traditional monitoring tools, NMC can help you locate performance problems more quickly. Improve system availability. Active management and automatic fault recover...