Yarn Resource Manager down


#1

YARN resource manger is down. Request you to please make it up


#2

Thank you for letting us know. I have turned it on.

Please carry on your work.

In the meantime, I am debugging why has it gone down.


#3

YARN resource manger is again down. Request you to please make it up


#4

Ouch! I have restarted it.


#5

Ok. Thank you for timely help


#6

YARN is down now. Can you please make it up?


#7

same here…yarn is down!!


#8

Unable to work with hive or pig. Please find the below error

2019-10-08 14:17:42,094 [JobControl] INFO org.apache.hadoop.yarn.client.AHSProxy - Connecting to Application History server at cxln2.c.thelab-240901.internal/10.142.1.2:10200
2019-10-08 14:17:43,146 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:44,148 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 1 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:45,149 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 2 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:46,150 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 3 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:47,151 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:48,152 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 5 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:49,153 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 6 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:50,154 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 7 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:51,155 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 8 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:52,156 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
^C2019-10-08 14:17:53,157 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 10 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:54,159 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 11 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)
2019-10-08 14:17:55,160 [JobControl] INFO org.apache.hadoop.ipc.Client - Retrying connect to server: cxln2.c.thelab-240901.internal/10.142.1.2:8050. Already tried 12 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)


#9

How to check Yarn Resource Manager is Up or down? I am in still learning phase


#10

Go to http://a.cloudxlab.com:8080/#/main/services/YARN/summary . You can see that resource manager in red (stopped)


#11

Resource Manager is down.
Kindly make it UP.

Thanks & Regards,
Gowtham.


#12

Thanks for quick response. :relaxed:


#13

This has been happening very frequently, Disappointing!!!

Please fix this ASAP,


#14

@sgiri - Hi Sandeep the YARN is down. Can you please restart?


#15

is there any fixed support time ? and not 24*7 ?


#16

Started resource manager.


#17

@sgiri YARN is again down :frowning:


#18

YARN is again down…please help us in this regard…!!!


#19


#20

After upscaling the machine, it took a while to bring up the services.