This page (revision-9) was last changed on 23-Apr-2022 17:06 by Harry Metske

This page was created on 23-Apr-2022 17:05 by unknown

Only authorized users are allowed to rename pages.

Only authorized users are allowed to delete pages.

Page revision history

Version Date Modified Size Author Changes ... Change note
9 23-Apr-2022 17:06 8 KB Harry Metske to previous
8 23-Apr-2022 17:05 8 KB Harry Metske to previous | to last
7 23-Apr-2022 17:05 6 KB Harry Metske to previous | to last
6 23-Apr-2022 17:05 5 KB Harry Metske to previous | to last
5 23-Apr-2022 17:05 5 KB Harry Metske to previous | to last
4 23-Apr-2022 17:05 4 KB Harry Metske to previous | to last
3 23-Apr-2022 17:05 2 KB Harry Metske to previous | to last
2 23-Apr-2022 17:05 1 KB Harry Metske to previous | to last
1 23-Apr-2022 17:05 1 KB unknown to last

Page References

Incoming links Outgoing links
Nomad...nobody

Version management

Difference between version and

At line 108 removed 95 lines
I fired up a RHEL7 ami (instead of the Amazon Linux AMI) to see if that helps, but that also gives the same error.
Assuming it has something to do with the driver, I changed the driver from "exec" to "raw_exec", but that gives a "missing drivers" :
{{{
[ec2-user@ip-172-31-22-13 jobs]$ nomad run SimpleHTTPServer.nomad
==> Monitoring evaluation "306cb11a-b012-c89f-aa36-69c9a0e44464"
Evaluation triggered by job "SimpleHTTPServer"
Scheduling error for group "cache" (failed to find a node for placement)
Allocation "7a0350ea-8396-96f7-308d-1aee4a500619" status "failed" (1/1 nodes filtered)
* Constraint "missing drivers" filtered 1 nodes
Evaluation status changed: "pending" -> "complete"
==> Evaluation "306cb11a-b012-c89f-aa36-69c9a0e44464" finished with status "complete"
}}}
Checking out the logs of the nomad client confirms:
{{{
2016/01/12 07:59:19 [DEBUG] client: available drivers [exec java]
}}}
! Looping PID 1 ?!
After running a nomad job like this :
{{{
driver = "exec"
config {
command = "/bin/bash"
args = [
"-c",
"mkdir ff && cd ff && curl --silent --show-error --remote-name 'http://www.computerhok.nl/tmp/dropwizardtest-1.2-assembly.zip' && unzip *.zip && cd * && java -jar dropwizardtest*.jar server helloworld.yaml"]
}
}}}
This results in a hardly responding host, and just before reboot:
{{{
[root@ip-172-31-19-2 log]#
Message from syslogd@ip-172-31-19-2 at Jan 17 09:25:51 ...
kernel:BUG: soft lockup - CPU#0 stuck for 23s! [nomad:9073]
}}}
!! chroot for exec task ?
According to the documentation, an "exec" task is running chrooted. Although we do like chrooted envs, I think looks like this requires quite some diskspace (730 MB for one python cmd) (not sure if that's true with all those mounts here):
{{{
[root@ip-172-31-29-208 alloc]# pwd
/home/ec2-user/nomad/data/alloc
[root@ip-172-31-29-208 alloc]# ls -l
total 8
drwx------ 3 root root 4096 Jan 13 07:05 651023c2-5bcc-9997-8350-c179eb38e73d
drwx------ 4 root root 4096 Jan 13 07:06 c6c34727-2e79-7112-674f-c02fba46209a
[root@ip-172-31-29-208 alloc]# du -cms . 2>/dev/null
731 .
731 total
[root@ip-172-31-29-208 alloc]# ls -l c6c34727-2e79-7112-674f-c02fba46209a/web
total 36
drwxrwxrwx 5 nobody nobody 4096 Jan 13 07:06 alloc
dr-xr-xr-x 2 root root 4096 Jan 13 07:06 bin
drwxr-xr-x 16 root root 2720 Jan 13 06:43 dev
drwxr-xr-x 75 root root 4096 Jan 13 07:06 etc
dr-xr-xr-x 7 root root 4096 Jan 13 07:06 lib
dr-xr-xr-x 10 root root 12288 Jan 13 07:06 lib64
drwxrwxrwx 2 nobody nobody 4096 Jan 13 07:06 local
dr-xr-xr-x 86 root root 0 Jan 13 06:43 proc
dr-xr-xr-x 5 root root 4096 Jan 13 07:06 usr
[root@ip-172-31-29-208 proc]# mount
mount: /proc/self/mountinfo: parse error: ignore entry at line 9.
mount: /proc/self/mountinfo: parse error: ignore entry at line 10.
mount: /proc/self/mountinfo: parse error: ignore entry at line 12.
mount: /proc/self/mountinfo: parse error: ignore entry at line 13.
mount: /proc/self/mountinfo: parse error: ignore entry at line 15.
mount: /proc/self/mountinfo: parse error: ignore entry at line 16.
proc on /proc type proc (rw,relatime)
sysfs on /sys type sysfs (rw,relatime)
/dev/xvda1 on / type ext4 (rw,noatime,data=ordered)
devtmpfs on /dev type devtmpfs (rw,relatime,size=500712k,nr_inodes=125178,mode=755)
devpts on /dev/pts type devpts (rw,relatime,gid=5,mode=620,ptmxmode=000)
tmpfs on /dev/shm type tmpfs (rw,relatime)
none on /proc/sys/fs/binfmt_misc type binfmt_misc (rw,relatime)
/dev/xvda1 on /home/ec2-user/nomad/data/alloc/651023c2-5bcc-9997-8350-c179eb38e73d/web/alloc type ext4 (rw,noatime,data=ordered)
none on /home/ec2-user/nomad/data/alloc/651023c2-5bcc-9997-8350-c179eb38e73d/web/dev
none on /home/ec2-user/nomad/data/alloc/651023c2-5bcc-9997-8350-c179eb38e73d/web/proc
/dev/xvda1 on /home/ec2-user/nomad/data/alloc/c6c34727-2e79-7112-674f-c02fba46209a/web/alloc type ext4 (rw,noatime,data=ordered)
none on /home/ec2-user/nomad/data/alloc/c6c34727-2e79-7112-674f-c02fba46209a/web/dev
none on /home/ec2-user/nomad/data/alloc/c6c34727-2e79-7112-674f-c02fba46209a/web/proc
/dev/xvda1 on /home/ec2-user/nomad/data/alloc/a3cc556f-eafe-4c30-a479-0fd63d8d63fd/fulltest-task/alloc type ext4 (rw,noatime,data=ordered)
none on /home/ec2-user/nomad/data/alloc/a3cc556f-eafe-4c30-a479-0fd63d8d63fd/fulltest-task/dev
none on /home/ec2-user/nomad/data/alloc/a3cc556f-eafe-4c30-a479-0fd63d8d63fd/fulltest-task/proc
}}}