Trace number 282748

Some explanations

A solver is run under the control of another program named runsolver. runsolver is in charge of imposing the CPU time limit and the memory limit to the solver. It also monitors some information about the process. The trace of the execution of a solver is divided into four (or five) parts:
  1. SOLVER DATA
    This is the output of the solver (stdout and stderr).
    Note that some very long lines in this section may be truncated by your web browser ! In such a case, you may want to use the "Download as text" link to get the trace as a text file.

    When the --timestamp option is passed to the runsolver program, each line output by the solver is prepended with a timestamp which indicates at what time the line was output by the solver. Times are relative to the start of the program, given in seconds, and are wall clock time (not CPU time).

    As some 'v lines' may be very long (sometimes several megabytes), the 'v line' output by your solver may be split on several lines to help limit the size of the trace recorded in the database. In any case, the exact output of your solver is preserved in a trace file.
  2. VERIFIER DATA
    The output of the solver is piped to a verifier program which will search a value line "v " and, if found, will check that the given interpretation satisfies all constraints.
  3. CONVERSION SCRIPT DATA (Optionnal)
    When a conversion script is used, this section shows the messages that were output by the conversion script.
  4. WATCHER DATA
    This is the informations gathered by the runsolver program. It first prints the different limits. There's a first limit on CPU time set to X seconds (see the parameters in the trace). After this time has ellapsed, runsolver sends a SIGTERM and 2 seconds later a SIGKILL to the solver. For safety, there's also another limit set to X+30 seconds which will send a SIGXPU to the solver. The last limit is on the virtual memory used by the process (see the parameters in the trace).
    Every ten seconds, the runsolver process fetches the content of /proc/loadavg, /proc/pid/stat and /proc/pid/statm (see man proc) and prints it as raw data. This is only recorded in case we need to investigate the behaviour of a solver. The memory used by the solver (vsize) is also given every ten seconds.
    When the solver exits, runsolver prints some informations such as status and time. CPU usage is the ratio CPU Time/Real Time.
  5. LAUNCHER DATA
    These informations are related to the script which will launch the solver. The most important informations are the command line given to the solver, the md5sum of the different files and the dump of the /proc/cpuinfo and /proc/meminfo which provides some useful information on the computer.

Solver answer on this benchmark

Solver NameAnswerCPU timeWall clock time
Toolbar_MaxSat 2007-01-19? (MO) 2.09 2.16016

General information on the benchmark

NameMaxCSP/celar/subs6/
scenw-6-sub3_ext.xml
MD5SUM532a0a01a58d627c1504ce0cf6a00535
Bench Category2-ARY-EXT (binary constraints in extension)
Best result obtained on this benchmarkMOPT
Best Number of satisfied constraints397
Best CPU time to get the best result obtained on this benchmark71.1462
Satisfiable
(Un)Satisfiability was proved
Number of variables18
Number of constraints421
Maximum constraint arity2
Maximum domain size44
Number of constraints which are defined in extension421
Number of constraints which are defined in intension0
Global constraints used (with number of constraints)

Solver Data (download as text)

c 
c 
c 
c conversion script
c 
c 

translate /tmp/evaluation/282748-1169322159/unknown.xml to /tmp/evaluation/282748-1169322159/unknown.wcsp

Verifier Data (download as text)

ERROR: no interpretation found !

Watcher Data (download as text)

runsolver version 3.1.3 (c) roussel@cril.univ-artois.fr

command line: runsolver --timestamp -w ROOT/results/node44/watcher-282748-1169322159 -o ROOT/results/node44/solver-282748-1169322159 -C 2400 -M 900 /tmp/evaluation/282748-1169322159/solver.sh /tmp/evaluation/282748-1169322159/unknown 

Enforcing CPUTime limit (soft limit, will send SIGTERM then SIGKILL): 2400 seconds
Enforcing CPUTime limit (hard limit, will send SIGXCPU): 2430 seconds
Enforcing VSIZE limit (soft limit, will send SIGTERM then SIGKILL): 921600 KiB
Enforcing VSIZE limit (hard limit, stack expansion will fail with SIGSEGV, brk() and mmap() will return ENOMEM): 972800 KiB
Current StackSize limit: 10240 KiB

/proc/loadavg: 1.00 0.98 1.02 5/73 24657
/proc/meminfo: memFree=1262472/2055920 swapFree=4191892/4192956
[pid=24656] ppid=24649 vsize=8564 CPUtime=0
/proc/24656/stat : 24656 (solver.sh) R 24649 24656 24507 0 -1 4194304 317 0 0 0 0 0 0 0 18 0 1 0 280615117 8769536 261 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 264840642514 0 2 4096 8192 0 0 0 17 0 0 0
/proc/24656/statm: 2141 261 201 79 0 134 0

[startup+0.102938 s]
/proc/loadavg: 1.00 0.98 1.02 5/73 24657
/proc/meminfo: memFree=1262472/2055920 swapFree=4191892/4192956
[pid=24656] ppid=24649 vsize=47560 CPUtime=0.01
/proc/24656/stat : 24656 (solver.sh) S 24649 24656 24507 0 -1 4194304 871 2241 0 0 0 0 0 1 25 0 1 0 280615117 48701440 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 264840080218 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/24656/statm: 11890 360 244 79 0 217 0
Current children cumulated CPU time (s) 0.01
Current children cumulated vsize (KiB) 47560

[startup+0.510979 s]
/proc/loadavg: 1.00 0.98 1.02 5/73 24657
/proc/meminfo: memFree=1262472/2055920 swapFree=4191892/4192956
[pid=24656] ppid=24649 vsize=47560 CPUtime=0.31
/proc/24656/stat : 24656 (solver.sh) S 24649 24656 24507 0 -1 4194304 899 2607 0 0 0 0 29 2 15 0 1 0 280615117 48701440 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 264840080218 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/24656/statm: 11890 360 244 79 0 217 0
Current children cumulated CPU time (s) 0.31
Current children cumulated vsize (KiB) 47560

[startup+1.33207 s]
/proc/loadavg: 1.00 0.98 1.02 3/76 24683
/proc/meminfo: memFree=804208/2055920 swapFree=4191892/4192956
[pid=24656] ppid=24649 vsize=47560 CPUtime=0.31
/proc/24656/stat : 24656 (solver.sh) S 24649 24656 24507 0 -1 4194304 899 2607 0 0 0 0 29 2 15 0 1 0 280615117 48701440 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 264840080218 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/24656/statm: 11890 360 244 79 0 217 0
[pid=24680] ppid=24656 vsize=783492 CPUtime=0.96
/proc/24680/stat : 24680 (toolbar) R 24656 24656 24507 0 -1 4194304 157416 2885 0 0 3 67 25 1 23 0 1 0 280615151 802295808 157382 18446744073709551615 134512640 135242688 4294956608 18446744073709551615 134580758 0 0 4096 0 0 0 0 17 1 0 0
/proc/24680/statm: 195873 157382 53 178 0 195691 0
Current children cumulated CPU time (s) 1.27
Current children cumulated vsize (KiB) 831052



Maximum VSize exceeded: sending SIGTERM then SIGKILL

[startup+2.14815 s]
/proc/loadavg: 1.00 0.98 1.02 3/76 24683
/proc/meminfo: memFree=388016/2055920 swapFree=4191892/4192956
[pid=24656] ppid=24649 vsize=47560 CPUtime=0.31
/proc/24656/stat : 24656 (solver.sh) S 24649 24656 24507 0 -1 4194304 899 2607 0 0 0 0 29 2 15 0 1 0 280615117 48701440 360 18446744073709551615 4194304 4520092 548682069344 18446744073709551615 264840080218 0 2 4096 73728 18446744071563181037 0 0 17 1 0 0
/proc/24656/statm: 11890 360 244 79 0 217 0
[pid=24680] ppid=24656 vsize=884824 CPUtime=1.78
/proc/24680/stat : 24680 (toolbar) R 24656 24656 24507 0 -1 4194304 216560 2885 0 0 59 93 25 1 25 0 1 0 280615151 906059776 216525 18446744073709551615 134512640 135242688 4294956608 18446744073709551615 134676026 0 0 4096 2 0 0 0 17 1 0 0
/proc/24680/statm: 221206 216525 70 178 0 221024 0
Current children cumulated CPU time (s) 2.09
Current children cumulated vsize (KiB) 932384

Sending SIGTERM to process tree (bottom up)
Sleeping 2 seconds

Child ended because it received signal 15 (SIGTERM)

!!! problem with CPU time !!!
wait4(...,&childrusage) returns 24656 and gives childrusage.ru_utime.tv_sec=0 childrusage.ru_utime.tv_usec=302953 childrusage.ru_stime.tv_sec=0 childrusage.ru_stime.tv_usec=29995
CPU time returned by wait4() is 0.332948
while last known CPU time is 2.09

Solver probably didn't/couldn't wait for its children
Using last known CPU time as value...

Real time (s): 2.16016
CPU time (s): 2.09
CPU user time (s): 1.13
CPU system time (s): 0.96
CPU usage (%): 96.7519
Max. virtual memory (cumulated for all children) (KiB): 932384

getrusage(RUSAGE_CHILDREN,...) data:
user time used= 0.302953
system time used= 0.029995
maximum resident set size= 0
integral shared memory size= 0
integral unshared data size= 0
integral unshared stack size= 0
page reclaims= 3506
page faults= 0
swaps= 0
block input operations= 0
block output operations= 0
messages sent= 0
messages received= 0
signals received= 0
voluntary context switches= 69
involuntary context switches= 32

runsolver used 0.001999 s user time and 0.016997 s system time

The end

Launcher Data (download as text)

Begin job on node44 on Sat Jan 20 19:42:39 UTC 2007


IDJOB= 282748
IDBENCH= 12885
IDSOLVER= 71
FILE ID= node44/282748-1169322159

PBS_JOBID= 3610440

Free space on /tmp= 66560 MiB

SOLVER NAME= Toolbar_MaxSat 2007-01-19
BENCH NAME= HOME/pub/bench/CPAI06/MaxCSP/celar/subs6/scenw-6-sub3_ext.xml
COMMAND LINE= /tmp/evaluation/282748-1169322159/solver.sh /tmp/evaluation/282748-1169322159/unknown
CONVERSION COMMAND LINE= runsolver -w ROOT/results/node44/convwatcher-282748-1169322159 -o ROOT/results/node44/conversion-282748-1169322159 -C 600 -M 900 /tmp/evaluation/282748-1169322159/translate /tmp/evaluation/282748-1169322159/unknown
CONVERSION RUNSOLVER STATUS CODE= 0
CONVERSION STATUS CODE= 0

RUNSOLVER COMMAND LINE= runsolver  --timestamp  -w ROOT/results/node44/watcher-282748-1169322159 -o ROOT/results/node44/solver-282748-1169322159 -C 2400 -M 900  /tmp/evaluation/282748-1169322159/solver.sh /tmp/evaluation/282748-1169322159/unknown

META MD5SUM SOLVER= f843f34905a307bcc0c6a322bc802c9d
MD5SUM BENCH=  532a0a01a58d627c1504ce0cf6a00535

RANDOM SEED= 746530601

TIME LIMIT= 2400 seconds

MEMORY LIMIT= 900 MiB


/proc/cpuinfo:
processor	: 0
vendor_id	: GenuineIntel
cpu family	: 15
model		: 4
model name	:                   Intel(R) Xeon(TM) CPU 3.00GHz
stepping	: 3
cpu MHz		: 3000.213
cache size	: 2048 KB
fpu		: yes
fpu_exception	: yes
cpuid level	: 5
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm pni monitor ds_cpl cid cx16 xtpr
bogomips	: 5914.62
clflush size	: 64
cache_alignment	: 128
address sizes	: 36 bits physical, 48 bits virtual
power management:

processor	: 1
vendor_id	: GenuineIntel
cpu family	: 15
model		: 4
model name	:                   Intel(R) Xeon(TM) CPU 3.00GHz
stepping	: 3
cpu MHz		: 3000.213
cache size	: 2048 KB
fpu		: yes
fpu_exception	: yes
cpuid level	: 5
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm pni monitor ds_cpl cid cx16 xtpr
bogomips	: 5586.94
clflush size	: 64
cache_alignment	: 128
address sizes	: 36 bits physical, 48 bits virtual
power management:


/proc/meminfo:
MemTotal:      2055920 kB
MemFree:       1262896 kB
Buffers:         57172 kB
Cached:         635684 kB
SwapCached:        268 kB
Active:         208088 kB
Inactive:       504992 kB
HighTotal:           0 kB
HighFree:            0 kB
LowTotal:      2055920 kB
LowFree:       1262896 kB
SwapTotal:     4192956 kB
SwapFree:      4191892 kB
Dirty:            8920 kB
Writeback:           0 kB
Mapped:          29948 kB
Slab:            65192 kB
Committed_AS:  5506868 kB
PageTables:       1740 kB
VmallocTotal: 536870911 kB
VmallocUsed:    264952 kB
VmallocChunk: 536605679 kB
HugePages_Total:     0
HugePages_Free:      0
Hugepagesize:     2048 kB

Free space on /tmp at the end= 66554 MiB



End job on node44 on Sat Jan 20 19:42:43 UTC 2007