Trace number 283406

Some explanations

A solver is run under the control of another program named runsolver. runsolver is in charge of imposing the CPU time limit and the memory limit to the solver. It also monitors some information about the process. The trace of the execution of a solver is divided into four (or five) parts:
  1. SOLVER DATA
    This is the output of the solver (stdout and stderr).
    Note that some very long lines in this section may be truncated by your web browser ! In such a case, you may want to use the "Download as text" link to get the trace as a text file.

    When the --timestamp option is passed to the runsolver program, each line output by the solver is prepended with a timestamp which indicates at what time the line was output by the solver. Times are relative to the start of the program, given in seconds, and are wall clock time (not CPU time).

    As some 'v lines' may be very long (sometimes several megabytes), the 'v line' output by your solver may be split on several lines to help limit the size of the trace recorded in the database. In any case, the exact output of your solver is preserved in a trace file.
  2. VERIFIER DATA
    The output of the solver is piped to a verifier program which will search a value line "v " and, if found, will check that the given interpretation satisfies all constraints.
  3. CONVERSION SCRIPT DATA (Optionnal)
    When a conversion script is used, this section shows the messages that were output by the conversion script.
  4. WATCHER DATA
    This is the informations gathered by the runsolver program. It first prints the different limits. There's a first limit on CPU time set to X seconds (see the parameters in the trace). After this time has ellapsed, runsolver sends a SIGTERM and 2 seconds later a SIGKILL to the solver. For safety, there's also another limit set to X+30 seconds which will send a SIGXPU to the solver. The last limit is on the virtual memory used by the process (see the parameters in the trace).
    Every ten seconds, the runsolver process fetches the content of /proc/loadavg, /proc/pid/stat and /proc/pid/statm (see man proc) and prints it as raw data. This is only recorded in case we need to investigate the behaviour of a solver. The memory used by the solver (vsize) is also given every ten seconds.
    When the solver exits, runsolver prints some informations such as status and time. CPU usage is the ratio CPU Time/Real Time.
  5. LAUNCHER DATA
    These informations are related to the script which will launch the solver. The most important informations are the command line given to the solver, the md5sum of the different files and the dump of the /proc/cpuinfo and /proc/meminfo which provides some useful information on the computer.

Solver answer on this benchmark

Solver NameAnswerCPU timeWall clock time
rjw-solver 2007-01-21? (MO) 29.09 29.2737

General information on the benchmark

Namehanoi/
hanoi-7_ext.xml
MD5SUMc08c67f756fce77fe7d340c44cf2c4b9
Bench Category2-ARY-EXT (binary constraints in extension)
Best result obtained on this benchmarkSAT
Best CPU time to get the best result obtained on this benchmark1.00685
SatisfiableYES
(Un)Satisfiability was provedYES
Number of variables126
Number of constraints125
Maximum constraint arity2
Maximum domain size2187
Number of constraints which are defined in extension125
Number of constraints which are defined in intension0
Global constraints used (with number of constraints)

Solver Data (download as text)

c 
c 
c 
c conversion script
c 
c 

Verifier Data (download as text)

ERROR: Unexpected answer ! (SAT/UNSAT expected)
Got answer: 

Watcher Data (download as text)

runsolver version 3.1.3 (c) roussel@cril.univ-artois.fr

command line: runsolver --timestamp -w ROOT/results/node70/watcher-283406-1169468666 -o ROOT/results/node70/solver-283406-1169468666 -C 1800 -M 900 /tmp/evaluation/283406-1169468666/solve /tmp/evaluation/283406-1169468666/unknown 1800 

Enforcing CPUTime limit (soft limit, will send SIGTERM then SIGKILL): 1800 seconds
Enforcing CPUTime limit (hard limit, will send SIGXCPU): 1830 seconds
Enforcing VSIZE limit (soft limit, will send SIGTERM then SIGKILL): 921600 KiB
Enforcing VSIZE limit (hard limit, stack expansion will fail with SIGSEGV, brk() and mmap() will return ENOMEM): 972800 KiB
Current StackSize limit: 10240 KiB

/proc/loadavg: 1.93 1.89 1.87 3/75 29903
/proc/meminfo: memFree=1707328/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) R 29900 29902 29115 0 -1 4194304 283 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402067615 0 65538 4100 65536 0 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=5352 CPUtime=0
/proc/29905/stat : 29905 (solve) R 29902 29902 29115 0 -1 4194368 15 0 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233401913378 0 0 4100 65536 0 0 0 17 1 0 0
/proc/29905/statm: 1338 229 192 169 0 49 0

[startup+0.102059 s]
/proc/loadavg: 1.93 1.89 1.87 3/75 29903
/proc/meminfo: memFree=1707328/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) S 29900 29902 29115 0 -1 4194304 315 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402065732 0 65536 4100 65538 18446744071563356171 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=5720 CPUtime=0.08
/proc/29905/stat : 29905 (xlisp) R 29902 29902 29115 0 -1 4194304 941 0 0 0 8 0 0 0 18 0 1 0 94378526 5857280 900 18446744073709551615 134512640 135322396 4294956848 18446744073709551615 8625243 0 0 4096 130 0 0 0 17 0 0 0
/proc/29905/statm: 1430 900 230 197 0 670 0
Current children cumulated CPU time (s) 0.08
Current children cumulated vsize (KiB) 11072

[startup+0.510102 s]
/proc/loadavg: 1.93 1.89 1.87 3/75 29903
/proc/meminfo: memFree=1707328/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) S 29900 29902 29115 0 -1 4194304 315 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402065732 0 65536 4100 65538 18446744071563356171 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=50144 CPUtime=0.49
/proc/29905/stat : 29905 (xlisp) R 29902 29902 29115 0 -1 4194304 9934 0 0 0 45 4 0 0 21 0 1 0 94378526 51347456 9893 18446744073709551615 134512640 135322396 4294956848 18446744073709551615 8879831 0 0 4096 130 0 0 0 17 0 0 0
/proc/29905/statm: 12536 9893 241 197 0 11776 0
Current children cumulated CPU time (s) 0.49
Current children cumulated vsize (KiB) 55496

[startup+1.33118 s]
/proc/loadavg: 1.93 1.89 1.87 3/77 29906
/proc/meminfo: memFree=1620976/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) S 29900 29902 29115 0 -1 4194304 315 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402065732 0 65536 4100 65538 18446744071563356171 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=107600 CPUtime=1.31
/proc/29905/stat : 29905 (xlisp) R 29902 29902 29115 0 -1 4194304 26400 0 0 0 119 12 0 0 25 0 1 0 94378526 110182400 26359 18446744073709551615 134512640 135322396 4294956848 18446744073709551615 134684815 0 0 4096 130 0 0 0 17 0 0 0
/proc/29905/statm: 26900 26359 241 197 0 26140 0
[pid=29906] ppid=29902 vsize=2600 CPUtime=0
/proc/29906/stat : 29906 (sed) S 29902 29902 29115 0 -1 4194304 148 0 0 0 0 0 0 0 17 0 1 0 94378526 2662400 120 18446744073709551615 4194304 4240196 548682069504 18446744073709551615 233402237010 0 0 4096 0 18446744071563648864 0 0 17 0 0 0
/proc/29906/statm: 650 120 96 11 0 54 0
Current children cumulated CPU time (s) 1.31
Current children cumulated vsize (KiB) 115552

[startup+2.96834 s]
/proc/loadavg: 1.93 1.89 1.87 3/77 29906
/proc/meminfo: memFree=1532400/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) S 29900 29902 29115 0 -1 4194304 315 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402065732 0 65536 4100 65538 18446744071563356171 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=219704 CPUtime=2.94
/proc/29905/stat : 29905 (xlisp) R 29902 29902 29115 0 -1 4194304 54426 0 0 0 269 25 0 0 25 0 1 0 94378526 224976896 54385 18446744073709551615 134512640 135322396 4294956848 18446744073709551615 134672456 0 0 4096 130 0 0 0 17 0 0 0
/proc/29905/statm: 54926 54385 241 197 0 54166 0
[pid=29906] ppid=29902 vsize=2600 CPUtime=0
/proc/29906/stat : 29906 (sed) S 29902 29902 29115 0 -1 4194304 148 0 0 0 0 0 0 0 17 0 1 0 94378526 2662400 120 18446744073709551615 4194304 4240196 548682069504 18446744073709551615 233402237010 0 0 4096 0 18446744071563648864 0 0 17 0 0 0
/proc/29906/statm: 650 120 96 11 0 54 0
Current children cumulated CPU time (s) 2.94
Current children cumulated vsize (KiB) 227656

[startup+6.26768 s]
/proc/loadavg: 1.94 1.89 1.87 3/77 29906
/proc/meminfo: memFree=1377520/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) S 29900 29902 29115 0 -1 4194304 315 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402065732 0 65536 4100 65538 18446744071563356171 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=369176 CPUtime=6.23
/proc/29905/stat : 29905 (xlisp) R 29902 29902 29115 0 -1 4194304 91794 0 0 0 581 42 0 0 25 0 1 0 94378526 378036224 91753 18446744073709551615 134512640 135322396 4294956848 18446744073709551615 134865584 0 0 4096 130 0 0 0 17 0 0 0
/proc/29905/statm: 92294 91753 241 197 0 91534 0
[pid=29906] ppid=29902 vsize=2600 CPUtime=0
/proc/29906/stat : 29906 (sed) S 29902 29902 29115 0 -1 4194304 148 0 0 0 0 0 0 0 17 0 1 0 94378526 2662400 120 18446744073709551615 4194304 4240196 548682069504 18446744073709551615 233402237010 0 0 4096 0 18446744071563648864 0 0 17 0 0 0
/proc/29906/statm: 650 120 96 11 0 54 0
Current children cumulated CPU time (s) 6.23
Current children cumulated vsize (KiB) 377128

[startup+12.7323 s]
/proc/loadavg: 1.94 1.89 1.87 3/77 29906
/proc/meminfo: memFree=1152944/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) S 29900 29902 29115 0 -1 4194304 315 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402065732 0 65536 4100 65538 18446744071563356171 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=574700 CPUtime=12.65
/proc/29905/stat : 29905 (xlisp) R 29902 29902 29115 0 -1 4194304 139492 0 0 0 1201 64 0 0 25 0 1 0 94378526 588492800 139451 18446744073709551615 134512640 135322396 4294956848 18446744073709551615 8879831 0 0 4096 130 0 0 0 17 0 0 0
/proc/29905/statm: 143675 139451 241 197 0 142915 0
[pid=29906] ppid=29902 vsize=2600 CPUtime=0
/proc/29906/stat : 29906 (sed) S 29902 29902 29115 0 -1 4194304 148 0 0 0 0 0 0 0 17 0 1 0 94378526 2662400 120 18446744073709551615 4194304 4240196 548682069504 18446744073709551615 233402237010 0 0 4096 0 18446744071563648864 0 0 17 0 0 0
/proc/29906/statm: 650 120 96 11 0 54 0
Current children cumulated CPU time (s) 12.65
Current children cumulated vsize (KiB) 582652

[startup+25.5706 s]
/proc/loadavg: 1.96 1.89 1.88 3/77 29906
/proc/meminfo: memFree=872240/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) S 29900 29902 29115 0 -1 4194304 315 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402065732 0 65536 4100 65538 18446744071563356171 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=854960 CPUtime=25.42
/proc/29905/stat : 29905 (xlisp) R 29902 29902 29115 0 -1 4194304 213240 0 0 0 2445 97 0 0 25 0 1 0 94378526 875479040 213199 18446744073709551615 134512640 135322396 4294956848 18446744073709551615 134684809 0 0 4096 130 0 0 0 17 0 0 0
/proc/29905/statm: 213740 213199 241 197 0 212980 0
[pid=29906] ppid=29902 vsize=2600 CPUtime=0
/proc/29906/stat : 29906 (sed) S 29902 29902 29115 0 -1 4194304 148 0 0 0 0 0 0 0 17 0 1 0 94378526 2662400 120 18446744073709551615 4194304 4240196 548682069504 18446744073709551615 233402237010 0 0 4096 0 18446744071563648864 0 0 17 0 0 0
/proc/29906/statm: 650 120 96 11 0 54 0
Current children cumulated CPU time (s) 25.42
Current children cumulated vsize (KiB) 862912



Maximum VSize exceeded: sending SIGTERM then SIGKILL

[startup+29.26 s]
/proc/loadavg: 1.96 1.90 1.88 3/77 29906
/proc/meminfo: memFree=778608/2055920 swapFree=4138760/4192956
[pid=29902] ppid=29900 vsize=5352 CPUtime=0
/proc/29902/stat : 29902 (solve) S 29900 29902 29115 0 -1 4194304 315 174 0 0 0 0 0 0 18 0 1 0 94378526 5480448 229 18446744073709551615 4194304 4889804 548682069328 18446744073709551615 233402065732 0 65536 4100 65538 18446744071563356171 0 0 17 1 0 0
/proc/29902/statm: 1338 229 192 169 0 49 0
[pid=29905] ppid=29902 vsize=929696 CPUtime=29.09
/proc/29905/stat : 29905 (xlisp) R 29902 29902 29115 0 -1 4194304 231924 0 0 0 2803 106 0 0 25 0 1 0 94378526 952008704 231883 18446744073709551615 134512640 135322396 4294956848 18446744073709551615 134865587 0 0 4096 130 0 0 0 17 0 0 0
/proc/29905/statm: 232424 231883 241 197 0 231664 0
[pid=29906] ppid=29902 vsize=2600 CPUtime=0
/proc/29906/stat : 29906 (sed) S 29902 29902 29115 0 -1 4194304 148 0 0 0 0 0 0 0 17 0 1 0 94378526 2662400 120 18446744073709551615 4194304 4240196 548682069504 18446744073709551615 233402237010 0 0 4096 0 18446744071563648864 0 0 17 0 0 0
/proc/29906/statm: 650 120 96 11 0 54 0
Current children cumulated CPU time (s) 29.09
Current children cumulated vsize (KiB) 937648

Sending SIGTERM to process tree (bottom up)
Sleeping 2 seconds

Child ended because it received signal 15 (SIGTERM)

!!! problem with CPU time !!!
wait4(...,&childrusage) returns 29902 and gives childrusage.ru_utime.tv_sec=0 childrusage.ru_utime.tv_usec=999 childrusage.ru_stime.tv_sec=0 childrusage.ru_stime.tv_usec=3999
CPU time returned by wait4() is 0.004998
while last known CPU time is 29.09

Solver probably didn't/couldn't wait for its children
Using last known CPU time as value...

Real time (s): 29.2737
CPU time (s): 29.09
CPU user time (s): 28.03
CPU system time (s): 1.06
CPU usage (%): 99.3725
Max. virtual memory (cumulated for all children) (KiB): 937648

getrusage(RUSAGE_CHILDREN,...) data:
user time used= 0.000999
system time used= 0.003999
maximum resident set size= 0
integral shared memory size= 0
integral unshared data size= 0
integral unshared stack size= 0
page reclaims= 637
page faults= 0
swaps= 0
block input operations= 0
block output operations= 0
messages sent= 0
messages received= 0
signals received= 0
voluntary context switches= 21
involuntary context switches= 3

runsolver used 0.021996 s user time and 0.093985 s system time

The end

Launcher Data (download as text)

Begin job on node70 on Mon Jan 22 12:24:27 UTC 2007


IDJOB= 283406
IDBENCH= 4134
IDSOLVER= 93
FILE ID= node70/283406-1169468666

PBS_JOBID= 3613837

Free space on /tmp= 66563 MiB

SOLVER NAME= rjw-solver 2007-01-21
BENCH NAME= HOME/pub/bench/CPAI06/hanoi/hanoi-7_ext.xml
COMMAND LINE= /tmp/evaluation/283406-1169468666/solve /tmp/evaluation/283406-1169468666/unknown 1800
CONVERSION COMMAND LINE= runsolver -w ROOT/results/node70/convwatcher-283406-1169468666 -o ROOT/results/node70/conversion-283406-1169468666 -C 600 -M 900 /tmp/evaluation/283406-1169468666/conversion /tmp/evaluation/283406-1169468666/unknown
CONVERSION RUNSOLVER STATUS CODE= 0
CONVERSION STATUS CODE= 0

RUNSOLVER COMMAND LINE= runsolver  --timestamp  -w ROOT/results/node70/watcher-283406-1169468666 -o ROOT/results/node70/solver-283406-1169468666 -C 1800 -M 900  /tmp/evaluation/283406-1169468666/solve /tmp/evaluation/283406-1169468666/unknown 1800

META MD5SUM SOLVER= 4108942ff566d5523369442a8e3d06c4
MD5SUM BENCH=  c08c67f756fce77fe7d340c44cf2c4b9

RANDOM SEED= 226827324

TIME LIMIT= 1800 seconds

MEMORY LIMIT= 900 MiB


/proc/cpuinfo:
processor	: 0
vendor_id	: GenuineIntel
cpu family	: 15
model		: 4
model name	:                   Intel(R) Xeon(TM) CPU 3.00GHz
stepping	: 3
cpu MHz		: 3000.234
cache size	: 2048 KB
fpu		: yes
fpu_exception	: yes
cpuid level	: 5
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm pni monitor ds_cpl cid cx16 xtpr
bogomips	: 5914.62
clflush size	: 64
cache_alignment	: 128
address sizes	: 36 bits physical, 48 bits virtual
power management:

processor	: 1
vendor_id	: GenuineIntel
cpu family	: 15
model		: 4
model name	:                   Intel(R) Xeon(TM) CPU 3.00GHz
stepping	: 3
cpu MHz		: 3000.234
cache size	: 2048 KB
fpu		: yes
fpu_exception	: yes
cpuid level	: 5
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm pni monitor ds_cpl cid cx16 xtpr
bogomips	: 5586.94
clflush size	: 64
cache_alignment	: 128
address sizes	: 36 bits physical, 48 bits virtual
power management:


/proc/meminfo:
MemTotal:      2055920 kB
MemFree:       1707808 kB
Buffers:          9356 kB
Cached:         243580 kB
SwapCached:      48644 kB
Active:         143636 kB
Inactive:       173000 kB
HighTotal:           0 kB
HighFree:            0 kB
LowTotal:      2055920 kB
LowFree:       1707808 kB
SwapTotal:     4192956 kB
SwapFree:      4138760 kB
Dirty:           11428 kB
Writeback:           0 kB
Mapped:          25060 kB
Slab:            16844 kB
Committed_AS:  1332376 kB
PageTables:       1776 kB
VmallocTotal: 536870911 kB
VmallocUsed:    264952 kB
VmallocChunk: 536605679 kB
HugePages_Total:     0
HugePages_Free:      0
Hugepagesize:     2048 kB

Free space on /tmp at the end= 66552 MiB



End job on node70 on Mon Jan 22 12:27:25 UTC 2007