History Server Component State

Microsoft.HDInsight.UnitMonitor.HistoryServerYarnComponentHealthState (UnitMonitor)

Monitors the health state of History Server process.

Knowledge Base article:

Summary

This monitor checks the health state of History Server process on the host. History Server is responsible for keeping track of executed jobs that are not handled by jobtracker anymore (jobs marked as "retired" by JobTracker). When History Server is not running cluster administrator is not able access details on executed jobs using "Job Tracker History" link from Map Reduce Web UI.

HDInsight Appliance

Monitor is active and reports actual component state.

HDInsight Azure

This monitor is not available in HDInsight clusters on Azure, so diagnostic and resolution steps below do not apply to this type of environment.

Causes

History Server service may be offline due to various reasons:

Resolutions

If History Server is not stopped on purpose, use the following steps to diagnose the issue:

Connecting remotely to the head node is a two-step operation:

To resolve the issue:

Element properties:

TargetMicrosoft.HDInsight.HostComponent.HistoryServerYarn
Parent MonitorSystem.Health.AvailabilityState
CategoryAvailabilityHealth
EnabledTrue
Alert GenerateTrue
Alert SeverityMatchMonitorHealth
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeMicrosoft.HDInsight.UnitMonitorType.HostComponentHealthState
RemotableTrue
AccessibilityPublic
Alert Message
History Server is not running.
History Server hosted on "{0}" in the cluster "{1}" reports "{2}" state.
RunAsDefault

Source Code:

<UnitMonitor ID="Microsoft.HDInsight.UnitMonitor.HistoryServerYarnComponentHealthState" TypeID="Microsoft.HDInsight.UnitMonitorType.HostComponentHealthState" Target="Microsoft.HDInsight.HostComponent.HistoryServerYarn" ParentMonitorID="Health!System.Health.AvailabilityState" Remotable="true" Priority="Normal" Accessibility="Public" Enabled="true" ConfirmDelivery="true">
<Category>AvailabilityHealth</Category>
<AlertSettings AlertMessage="Microsoft.HDInsight.UnitMonitor.HistoryServerComponentHealthState.AlertMessage">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Target/Host/Host/Property[Type="Microsoft.HDInsight.Host.Private"]/HostName$</AlertParameter1>
<AlertParameter2>$Target/Host/Host/Property[Type="Microsoft.HDInsight.Host.Private"]/ClusterName$</AlertParameter2>
<AlertParameter3>$Data/Context/Property[@Name='$Target/Property[Type="Microsoft.HDInsight.HostComponent"]/ComponentName$']$</AlertParameter3>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState ID="Healthy" MonitorTypeStateID="Healthy" HealthState="Success"/>
<OperationalState ID="Unhealthy" MonitorTypeStateID="Unhealthy" HealthState="Warning"/>
</OperationalStates>
<Configuration>
<IntervalSeconds>900</IntervalSeconds>
<TimeoutSeconds>300</TimeoutSeconds>
</Configuration>
</UnitMonitor>