Yarn Resource Manager down

YARN resource manger is down. Request you to please make it up

Thank you for letting us know. I have turned it on.

Please carry on your work.

In the meantime, I am debugging why has it gone down.

YARN resource manger is again down. Request you to please make it up

Ouch! I have restarted it.

Ok. Thank you for timely help

YARN is down now. Can you please make it up?

same here…yarn is down!!

Unable to work with hive or pig. Please find the below error

2019-10-08 14:17:42,094 [JobControl] INFO org.apache.hadoop.yarn.client.AHSProxy - Connecting to Application History server at cxln2.c.thelab-240901.internal/10.142.1.2:10200
2019-10-08 14:17:43,146 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:44,148 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 1 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:45,149 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 2 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:46,150 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 3 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:47,151 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:48,152 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 5 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:49,153 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 6 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:50,154 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 7 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:51,155 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 8 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:52,156 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
^C2019-10-08 14:17:53,157 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 10 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:54,159 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 11 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:55,160 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 12 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)

How to check Yarn Resource Manager is Up or down? I am in still learning phase

Go to http://a.cloudxlab.com:8080/#/main/services/YARN/summary . You can see that resource manager in red (stopped)

1 Like

Resource Manager is down.
Kindly make it UP.

Thanks & Regards,
Gowtham.

Thanks for quick response. :relaxed:

This has been happening very frequently, Disappointing!!!

Please fix this ASAP,

@sgiri - Hi Sandeep the YARN is down. Can you please restart?

is there any fixed support time ? and not 24*7 ?

Started resource manager.

2 Likes

@sgiri YARN is again down :frowning:

YARN is again down…please help us in this regard…!!!

After upscaling the machine, it took a while to bring up the services.

1 Like