Trace number 280571

Some explanations

A solver is run under the control of another program named runsolver. runsolver is in charge of imposing the CPU time limit and the memory limit to the solver. It also monitors some information about the process. The trace of the execution of a solver is divided into four (or five) parts:
  1. SOLVER DATA
    This is the output of the solver (stdout and stderr).
    Note that some very long lines in this section may be truncated by your web browser ! In such a case, you may want to use the "Download as text" link to get the trace as a text file.

    When the --timestamp option is passed to the runsolver program, each line output by the solver is prepended with a timestamp which indicates at what time the line was output by the solver. Times are relative to the start of the program, given in seconds, and are wall clock time (not CPU time).

    As some 'v lines' may be very long (sometimes several megabytes), the 'v line' output by your solver may be split on several lines to help limit the size of the trace recorded in the database. In any case, the exact output of your solver is preserved in a trace file.
  2. VERIFIER DATA
    The output of the solver is piped to a verifier program which will search a value line "v " and, if found, will check that the given interpretation satisfies all constraints.
  3. CONVERSION SCRIPT DATA (Optionnal)
    When a conversion script is used, this section shows the messages that were output by the conversion script.
  4. WATCHER DATA
    This is the informations gathered by the runsolver program. It first prints the different limits. There's a first limit on CPU time set to X seconds (see the parameters in the trace). After this time has ellapsed, runsolver sends a SIGTERM and 2 seconds later a SIGKILL to the solver. For safety, there's also another limit set to X+30 seconds which will send a SIGXPU to the solver. The last limit is on the virtual memory used by the process (see the parameters in the trace).
    Every ten seconds, the runsolver process fetches the content of /proc/loadavg, /proc/pid/stat and /proc/pid/statm (see man proc) and prints it as raw data. This is only recorded in case we need to investigate the behaviour of a solver. The memory used by the solver (vsize) is also given every ten seconds.
    When the solver exits, runsolver prints some informations such as status and time. CPU usage is the ratio CPU Time/Real Time.
  5. LAUNCHER DATA
    These informations are related to the script which will launch the solver. The most important informations are the command line given to the solver, the md5sum of the different files and the dump of the /proc/cpuinfo and /proc/meminfo which provides some useful information on the computer.

Solver answer on this benchmark

Solver NameAnswerCPU timeWall clock time
Toolbar_BTD 2007-01-12? (MO) 27 27.2139

General information on the benchmark

NameMaxCSP/
pedigree/sheep4r_ext.xml
MD5SUM2bde362b821596dc76cbeda87f41a444
Bench CategoryN-ARY-EXT (n-ary constraints in extension)
Best result obtained on this benchmarkMOPT
Best Number of satisfied constraints11105
Best CPU time to get the best result obtained on this benchmark679.678
Satisfiable
(Un)Satisfiability was proved
Number of variables8921
Number of constraints11107
Maximum constraint arity3
Maximum domain size10
Number of constraints which are defined in extension11107
Number of constraints which are defined in intension0
Global constraints used (with number of constraints)

Solver Data (download as text)

c 
c 
c 
c conversion script
c 
c 

translate /tmp/evaluation/280571-1169306738/unknown.xml to /tmp/evaluation/280571-1169306738/unknown.wcsp

Verifier Data (download as text)

ERROR: no interpretation found !

Watcher Data (download as text)

runsolver version 3.1.3 (c) roussel@cril.univ-artois.fr

command line: runsolver --timestamp -w ROOT/results/node35/watcher-280571-1169306738 -o ROOT/results/node35/solver-280571-1169306738 -C 2400 -M 900 /tmp/evaluation/280571-1169306738/solver.sh /tmp/evaluation/280571-1169306738/unknown 

Enforcing CPUTime limit (soft limit, will send SIGTERM then SIGKILL): 2400 seconds
Enforcing CPUTime limit (hard limit, will send SIGXCPU): 2430 seconds
Enforcing VSIZE limit (soft limit, will send SIGTERM then SIGKILL): 921600 KiB
Enforcing VSIZE limit (hard limit, stack expansion will fail with SIGSEGV, brk() and mmap() will return ENOMEM): 972800 KiB
Current StackSize limit: 10240 KiB

/proc/loadavg: 1.93 1.40 1.15 5/73 32357
/proc/meminfo: memFree=1046488/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=6392 CPUtime=0
/proc/32356/stat : 32356 (solver.sh) R 32354 32356 31863 0 -1 4194304 236 0 0 0 0 0 0 0 20 0 1 0 279068881 6545408 198 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266131764940 0 0 4096 8192 0 0 0 17 1 0 0
/proc/32356/statm: 1598 198 148 79 0 122 0

[startup+0.105216 s]
/proc/loadavg: 1.93 1.40 1.15 5/73 32357
/proc/meminfo: memFree=1046488/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=47532 CPUtime=0.01
/proc/32356/stat : 32356 (solver.sh) S 32354 32356 31863 0 -1 4194304 847 2113 0 0 0 0 0 1 20 0 1 0 279068881 48672768 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266134023002 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/32356/statm: 11883 360 245 79 0 217 0
Current children cumulated CPU time (s) 0.01
Current children cumulated vsize (KiB) 47532

[startup+0.515427 s]
/proc/loadavg: 1.93 1.40 1.15 5/73 32357
/proc/meminfo: memFree=1046488/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=47532 CPUtime=0.01
/proc/32356/stat : 32356 (solver.sh) S 32354 32356 31863 0 -1 4194304 847 2113 0 0 0 0 0 1 20 0 1 0 279068881 48672768 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266134023002 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/32356/statm: 11883 360 245 79 0 217 0
Current children cumulated CPU time (s) 0.01
Current children cumulated vsize (KiB) 47532

[startup+1.33636 s]
/proc/loadavg: 1.93 1.40 1.15 3/74 32373
/proc/meminfo: memFree=1019600/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=47532 CPUtime=0.01
/proc/32356/stat : 32356 (solver.sh) S 32354 32356 31863 0 -1 4194304 847 2113 0 0 0 0 0 1 20 0 1 0 279068881 48672768 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266134023002 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/32356/statm: 11883 360 245 79 0 217 0
[pid=32373] ppid=32356 vsize=330964 CPUtime=1.28
/proc/32373/stat : 32373 (toolbarBtd) R 32356 32356 31863 0 -1 0 9186 0 0 0 123 5 0 0 25 0 1 0 279068885 338907136 7180 18446744073709551615 134512640 135169536 4294956528 18446744073709551615 134950493 0 0 4096 0 0 0 0 17 1 0 0
/proc/32373/statm: 82741 7180 46 160 0 82577 0
Current children cumulated CPU time (s) 1.29
Current children cumulated vsize (KiB) 378496

[startup+2.97365 s]
/proc/loadavg: 1.93 1.40 1.15 3/74 32373
/proc/meminfo: memFree=998032/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=47532 CPUtime=0.01
/proc/32356/stat : 32356 (solver.sh) S 32354 32356 31863 0 -1 4194304 847 2113 0 0 0 0 0 1 20 0 1 0 279068881 48672768 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266134023002 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/32356/statm: 11883 360 245 79 0 217 0
[pid=32373] ppid=32356 vsize=343636 CPUtime=2.89
/proc/32373/stat : 32373 (toolbarBtd) R 32356 32356 31863 0 -1 0 15141 0 0 0 282 7 0 0 25 0 1 0 279068885 351883264 11747 18446744073709551615 134512640 135169536 4294956528 18446744073709551615 134517659 0 0 4096 0 0 0 0 17 1 0 0
/proc/32373/statm: 85909 11747 46 160 0 85745 0
Current children cumulated CPU time (s) 2.9
Current children cumulated vsize (KiB) 391168

[startup+6.26396 s]
/proc/loadavg: 1.93 1.41 1.15 3/74 32373
/proc/meminfo: memFree=996880/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=47532 CPUtime=0.01
/proc/32356/stat : 32356 (solver.sh) S 32354 32356 31863 0 -1 4194304 847 2113 0 0 0 0 0 1 20 0 1 0 279068881 48672768 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266134023002 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/32356/statm: 11883 360 245 79 0 217 0
[pid=32373] ppid=32356 vsize=402708 CPUtime=6.16
/proc/32373/stat : 32373 (toolbarBtd) R 32356 32356 31863 0 -1 0 71704 0 0 0 597 19 0 0 25 0 1 0 279068885 412372992 24549 18446744073709551615 134512640 135169536 4294956528 18446744073709551615 134818001 0 0 4096 0 0 0 0 17 1 0 0
/proc/32373/statm: 100677 24549 48 160 0 100513 0
Current children cumulated CPU time (s) 6.17
Current children cumulated vsize (KiB) 450240

[startup+12.7247 s]
/proc/loadavg: 1.94 1.42 1.15 3/74 32373
/proc/meminfo: memFree=636112/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=47532 CPUtime=0.01
/proc/32356/stat : 32356 (solver.sh) S 32354 32356 31863 0 -1 4194304 847 2113 0 0 0 0 0 1 20 0 1 0 279068881 48672768 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266134023002 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/32356/statm: 11883 360 245 79 0 217 0
[pid=32373] ppid=32356 vsize=661444 CPUtime=12.56
/proc/32373/stat : 32373 (toolbarBtd) R 32356 32356 31863 0 -1 0 168511 0 0 0 1200 56 0 0 25 0 1 0 279068885 677318656 89566 18446744073709551615 134512640 135169536 4294956528 18446744073709551615 134626959 0 0 4096 0 0 0 0 17 1 0 0
/proc/32373/statm: 165361 89566 51 160 0 165197 0
Current children cumulated CPU time (s) 12.57
Current children cumulated vsize (KiB) 708976

[startup+25.5603 s]
/proc/loadavg: 1.87 1.43 1.16 3/81 32506
/proc/meminfo: memFree=992080/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=47532 CPUtime=0.01
/proc/32356/stat : 32356 (solver.sh) S 32354 32356 31863 0 -1 4194304 847 2113 0 0 0 0 0 1 20 0 1 0 279068881 48672768 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266134023002 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/32356/statm: 11883 360 245 79 0 217 0
[pid=32373] ppid=32356 vsize=661840 CPUtime=25.33
/proc/32373/stat : 32373 (toolbarBtd) R 32356 32356 31863 0 -1 0 168614 0 0 0 2475 58 0 0 25 0 1 0 279068885 677724160 89669 18446744073709551615 134512640 135169536 4294956528 18446744073709551615 134624936 0 0 4096 0 0 0 0 17 0 0 0
/proc/32373/statm: 165460 89669 51 160 0 165296 0
Current children cumulated CPU time (s) 25.34
Current children cumulated vsize (KiB) 709372



Maximum VSize exceeded: sending SIGTERM then SIGKILL

[startup+27.2025 s]
/proc/loadavg: 1.87 1.43 1.16 3/81 32506
/proc/meminfo: memFree=774608/2055920 swapFree=4184640/4192956
[pid=32356] ppid=32354 vsize=47532 CPUtime=0.01
/proc/32356/stat : 32356 (solver.sh) S 32354 32356 31863 0 -1 4194304 847 2113 0 0 0 0 0 1 20 0 1 0 279068881 48672768 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 266134023002 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/32356/statm: 11883 360 245 79 0 217 0
[pid=32373] ppid=32356 vsize=905308 CPUtime=26.99
/proc/32373/stat : 32373 (toolbarBtd) R 32356 32356 31863 0 -1 0 222585 0 0 0 2616 83 0 0 25 0 1 0 279068885 927035392 143640 18446744073709551615 134512640 135169536 4294956528 18446744073709551615 134818001 0 0 4096 0 0 0 0 17 0 0 0
/proc/32373/statm: 226327 143648 51 160 0 226163 0
Current children cumulated CPU time (s) 27
Current children cumulated vsize (KiB) 952840

Sending SIGTERM to process tree (bottom up)
Sleeping 2 seconds

Child ended because it received signal 15 (SIGTERM)

!!! problem with CPU time !!!
wait4(...,&childrusage) returns 32356 and gives childrusage.ru_utime.tv_sec=0 childrusage.ru_utime.tv_usec=10998 childrusage.ru_stime.tv_sec=0 childrusage.ru_stime.tv_usec=20996
CPU time returned by wait4() is 0.031994
while last known CPU time is 27

Solver probably didn't/couldn't wait for its children
Using last known CPU time as value...

Real time (s): 27.2139
CPU time (s): 27
CPU user time (s): 26.16
CPU system time (s): 0.84
CPU usage (%): 99.2141
Max. virtual memory (cumulated for all children) (KiB): 952840

getrusage(RUSAGE_CHILDREN,...) data:
user time used= 0.010998
system time used= 0.020996
maximum resident set size= 0
integral shared memory size= 0
integral unshared data size= 0
integral unshared stack size= 0
page reclaims= 2960
page faults= 0
swaps= 0
block input operations= 0
block output operations= 0
messages sent= 0
messages received= 0
signals received= 0
voluntary context switches= 60
involuntary context switches= 26

runsolver used 0.042993 s user time and 0.083987 s system time

The end

Launcher Data (download as text)

Begin job on node35 on Sat Jan 20 15:25:39 UTC 2007


IDJOB= 280571
IDBENCH= 11874
IDSOLVER= 72
FILE ID= node35/280571-1169306738

PBS_JOBID= 3610116

Free space on /tmp= 66542 MiB

SOLVER NAME= Toolbar_BTD 2007-01-12
BENCH NAME= HOME/pub/bench/CPAI06/MaxCSP/pedigree/sheep4r_ext.xml
COMMAND LINE= /tmp/evaluation/280571-1169306738/solver.sh /tmp/evaluation/280571-1169306738/unknown
CONVERSION COMMAND LINE= runsolver -w ROOT/results/node35/convwatcher-280571-1169306738 -o ROOT/results/node35/conversion-280571-1169306738 -C 600 -M 900 /tmp/evaluation/280571-1169306738/translate /tmp/evaluation/280571-1169306738/unknown
CONVERSION RUNSOLVER STATUS CODE= 0
CONVERSION STATUS CODE= 0

RUNSOLVER COMMAND LINE= runsolver  --timestamp  -w ROOT/results/node35/watcher-280571-1169306738 -o ROOT/results/node35/solver-280571-1169306738 -C 2400 -M 900  /tmp/evaluation/280571-1169306738/solver.sh /tmp/evaluation/280571-1169306738/unknown

META MD5SUM SOLVER= f75ee5e830002fa98429bd59d9dcbc78
MD5SUM BENCH=  2bde362b821596dc76cbeda87f41a444

RANDOM SEED= 532099382

TIME LIMIT= 2400 seconds

MEMORY LIMIT= 900 MiB


/proc/cpuinfo:
processor	: 0
vendor_id	: GenuineIntel
cpu family	: 15
model		: 4
model name	:                   Intel(R) Xeon(TM) CPU 3.00GHz
stepping	: 3
cpu MHz		: 3000.234
cache size	: 2048 KB
fpu		: yes
fpu_exception	: yes
cpuid level	: 5
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm pni monitor ds_cpl cid cx16 xtpr
bogomips	: 5914.62
clflush size	: 64
cache_alignment	: 128
address sizes	: 36 bits physical, 48 bits virtual
power management:

processor	: 1
vendor_id	: GenuineIntel
cpu family	: 15
model		: 4
model name	:                   Intel(R) Xeon(TM) CPU 3.00GHz
stepping	: 3
cpu MHz		: 3000.234
cache size	: 2048 KB
fpu		: yes
fpu_exception	: yes
cpuid level	: 5
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm pni monitor ds_cpl cid cx16 xtpr
bogomips	: 5586.94
clflush size	: 64
cache_alignment	: 128
address sizes	: 36 bits physical, 48 bits virtual
power management:


/proc/meminfo:
MemTotal:      2055920 kB
MemFree:       1046968 kB
Buffers:         41016 kB
Cached:         580360 kB
SwapCached:       2184 kB
Active:         477648 kB
Inactive:       461276 kB
HighTotal:           0 kB
HighFree:            0 kB
LowTotal:      2055920 kB
LowFree:       1046968 kB
SwapTotal:     4192956 kB
SwapFree:      4184640 kB
Dirty:           21588 kB
Writeback:           0 kB
Mapped:         325736 kB
Slab:            54868 kB
Committed_AS:  5830124 kB
PageTables:       2332 kB
VmallocTotal: 536870911 kB
VmallocUsed:    264952 kB
VmallocChunk: 536605679 kB
HugePages_Total:     0
HugePages_Free:      0
Hugepagesize:     2048 kB

Free space on /tmp at the end= 66542 MiB



End job on node35 on Sat Jan 20 15:26:20 UTC 2007