Call for Solvers and Benchmarks
Third International CSP Solver Competition
(CSP, Max-CSP and Weighted-CSP competition)
`http://cpai.ucc.ie/08/`

The third international CSP solver competition is organised to improve our knowledge of what is behind the efficiency of algorithms, heuristics, solving strategies and constraint systems. In the first edition (2005), only constraints represented in extension were authorized whereas in the second edition (2006) both constraints represented in extension and in intension were allowed (as well as the global constraint allDifferent). In 2006, the Max-CSP problem was also considered. With this third edition, we hope to reach maturity by introducing more global constraints and collecting both a broader set of instances and a higher number of contestants¹. For this new edition, the Weighted-CSP (WCSP) problem is also introduced. As a summary, the 2008 competition will consider the CSP, Max-CSP and WCSP problems, constraints defined in extension, intension or corresponding to some of the most used global constraints.

To participate to the competition, it is not necessary to submit a solver which can deal with all categories of constraints and all problem formats. Submitting a solver which is capable of dealing with only one kind of constraint (e.g. binary, extensional constraints) and one kind of CSP (e.g. ordinary CSP) is also allowed. The only requirement is to register solvers to the categories they are able to handle.

This call for solvers and benchmarks presents the problems and categories that will be considered during the competition. In particular, details about the representation of CSP instances, the execution environment, as well as the rules that solvers must follow, are given.

The results of the competition will be discussed at one of the CP'08 workshops. At the workshop, we will also discuss the organisation of the next edition of the competition.

1 Timetable

The deadlines of the competition are defined below:

Opening of the registration site at
`http://www.cril.univ-artois.fr/CPAI08`	May 2008
Pre-registration of contestants	May 20, 2008
Final registration (submission of solvers and benchmarks)	May 30, 2008
Test of solvers conformance	June 2008
Competition running	Summer 2008
Final results available	during CP 2008

Once submitted, solvers will be run on a limited number of benchmarks to make sure that they interact correctly with the evaluation environment. Potential problems will be reported to the authors in June 2008. Bug fixes will be due by the end of June 2008.

2 Problems and Categories

Anyone can submit a solver to any particular instance and solver category with respect to a general problem (CSP, Max-CSP or WCSP). For example, it is possible to just register a complete solver to the 2-ARY-EXT category (i.e. to deal with instances only involving binary constraints in extension) for CSP. In other words, contestants can enter the competition for different problems, instance categories and solver categories.

2.1 Problems

There are three problems:

CSP
Max-CSP
WCSP

The objective for CSP is finding a solution or proving that no solution exists. The objective for Max-CSP is finding an instantiation of variables that violates as few constraints as possible (or, equivalently, that satisfies as many constraints as possible). The objective for WCSP is finding an instantiation of variables that corresponds to the minimal violation cost (a cost or weight is associated with each tuple of each constraint). CSP is a decision problem whereas Max-CSP and WCSP are optimization ones.

2.2 Instance Categories

For all problems (CSP, Max-CSP and WCSP), the instance categories are

2-ARY-EXT:: instances only involving binary (and unary) constraints in extension.
N-ARY-EXT:: instances involving constraints in extension. At least one constraint has an arity strictly greater than .
2-ARY-INT:: instances involving binary (and unary) constraints in intension. Binary (and unary) constraints in extension can also be involved but no global constraint can be involved. At least, one involved constraint is in intension.
N-ARY-INT:: instances involving constraints in intension. Constraints in extension can also be involved but no global constraint can be involved. At least, one involved constraint is in intension and at least one involved constraint has an arity strictly greater than .
GLB:: instances involving any kind of constraints, including global constraints. A solver that doesn't have support for one kind of global constraints should just answer ``UNKNOWN'' as soon as it reads the unsupported constraint.

The GLB category will not be used for the WCSP competition.

2.3 Solver Categories

The solver categories are:

complete
incomplete

Complete solvers can determine if an instance is satisfiable or not (or find and prove the optimum for Max-CSP and WCSP) whereas incomplete solvers cannot prove the unsatisfiability or the optimum.

3 Resources

3.1 Format

The description of the format, called XCSP 2.1, used to represent CSP instances is now available. It can be found at http://cpai.ucc.ie/08/ or http://www.cril.univ-artois.fr/CPAI08/.

3.2 Benchmarks

Many benchmarks (including those used for the 2006 CSP Solver Competition) may be found at:

http://www.cril.univ-artois.fr/~lecoutre/research/benchmarks/

The organisers invite anybody to submit new benchmarks. The organisers are particularly interested in new problem instances originating from real-world applications.

3.3 Tools

Some tools are also provided. They can be found from http://cpai.ucc.ie/08/.

C++ and C parsers for the XML format can be found at:

http://www.cril.univ-artois.fr/~roussel/CSP-XML-parser

A Java parser for the XML format can be found at:

http://www.cril.univ-artois.fr/~lecoutre/research/tools

Other tools that are currently available from the last URL given above are:

: a tool that allows to compute the number of constraints violated by a full instantiation of the variables of a CSP instance.
: a tool that allows to check the validity of CSP instances (in format XCSP 2.1) and to convert in extension constraints defined in intension.
: a tool that allows to shuffle variables and constraints of CSP instances.

New tools that have been developed (by Andrea Rendl) are:

: a converter from XCSP to Essence'.
: a converter from XCSP/Essence' to Minion.
: a converter from Essence' to XCSP.

4 Execution Environment

Solvers will run on a cluster of computers using the Linux operating system. They will run under the control of another program (runsolver) which will enforce some limits on the memory and the total CPU time used by the program. Solvers will be run inside a sandbox that will prevent unauthorized use of the system (network connections, file creation outside the allowed directory, among others). All solvers will be run as a 32 bits application. Therefore, if you submit an executable, you are required to provide us with a 32 bit ELF executable (preferably statically linked). Solvers submitted in source form will be compiled on a 32 bit platform. Two executions of a solver with the same parameters and system resources must output the same result in approximately the same time (so that the experiments can be repeated).

During the submission process, you will be asked to provide the organizers with a suggested command line that should be used to run your solver. In this command line, you will be asked to use the following placeholders, which will be replaced by the actual information given by the evaluation environment.

BENCHNAME will be replaced by the name of the file containing the instance to solve (including the path to the file). Obviously, the solver must use this parameter (or alternatively BENCHNAMENOEXT).
BENCHNAMENOEXT will be replaced by the base name of the file containing the instance to solve (i.e. without the filename extension, but with the complete path to the file).
RANDOMSEED will be replaced by a random seed which is a number between and . This parameter MUST be used to initialize the random number generator when the solver uses random numbers. It is recorded by the evaluation environment and will allow to run the program on a given instance under the same conditions if necessary.
TIMEOUT represents the total CPU time (in seconds) that the solver may use before being killed. May be used to adapt the solver strategy.
MEMLIMIT represents the total amount of memory (in MiB) that the solver may use before being killed. May be used to adapt the solver strategy.
TMPDIR is the name of the only directory where the solver is allowed to read/write temporary files.
DIR is the name of the directory where the solver files will be stored.

Examples of command lines:

  DIR/mysolver BENCHNAME RANDOMSEED
  DIR/mysolver --mem-limit=MEMLIMIT --time-limit=TIMELIMIT --tmpdir=TMPDIR BENCHNAME
  java -jar DIR/mysolver.jar -c DIR/mysolver.conf BENCHNAME

As an example, these command lines could be expanded by the evaluation environment as:

  /solver10/mysolver file.pb 1720968
  /solver10/mysolver --mem-limit=900 --time-limit=1200 --tmpdir=/tmp/job12345 file.pb
  java -jar /solver10/mysolver.jar -c /solver10/mysolver.conf file.pb

The command line provided by the submitter is only a suggested command line. Organizers may have to modify this command line (e.g. memory limits of the Java Virtual Machine (JVM) may have to be modified to cope with the actual memory limits).

The solver may also (optionally) use the values of the following environment variables:

TIMEOUT (the number of seconds it will be allowed to run)
MEMLIMIT (the amount of RAM in MiB available to the solver)
TMPDIR (the absolute pathname of the only directory where the solver is allowed to create temporary files)

After TIMEOUT seconds have elapsed, the solver will first receive a SIGTERM to give it a chance to output the best solution found so far (in the case of an optimizing solver). One second later, the program will receive a SIGKILL signal from the controlling program to terminate the solver.

Similarly, a solver that uses more memory than the limit defined by MEMLIMIT will be sent a SIGTERM followed one second later by a SIGKILL.

The time and memory limits will be defined in accordance with the number of solvers and benchmarks that will enter the competition. One should expect a time limit of at least 20 minutes and a memory limit of at least 900 MiB.

The solver cannot write to any file except standard output, standard error and files in the TMPDIR directory. A solver is not allowed to open any network connection or launch external commands which are not related to the solving process. Solvers can use several processes or threads.

5 Output Rules

The evaluation environment records everything that is output by your solver on stdout/stderr (up to a limit of 1MiB) and is able to timestamp each line. This can be very informative to check how your solvers behaved on some instances.

Therefore solvers must output messages to the standard output and those messages will be used to check the results. The output format is inspired by the DIMACS output specification of the SAT competition and can be used to manually check some results. Lines output by the solver should be prefixed by "c ","s ","v ", "d " or "o ". Lines which do not start with one of these prefixes are considered as comment lines and are ignored. The meaning of these prefixes is detailed below.

5.1 Lines

There exist 5 different types of lines. They are defined as follows:

solution ("s " line)
These lines are mandatory and start with the two following characters: lower case s followed by a space (ASCII code 32). These two characters are followed by one of the following answers:
- SATISFIABLE (used for CSP, Max-CSP and WCSP)
- UNSATISFIABLE (used for CSP)
- UNKNOWN (used for CSP, Max-CSP and WCSP)
- OPTIMUM FOUND (used for Max-CSP and WCSP)
It is of uttermost importance to respect the exact spelling of these answers. Any mistake in the writing of these lines will cause the answer to be disregarded. Solvers are not required to provide any specific exit code corresponding to their answer.
See Subsections 5.2 and 5.3 about their use.
values ("v " line)
These lines are mandatory and start with the two following characters: lower case v followed by a space (ASCII code 32) and followed by a solution of the problem.
Here, a solution is a sequence of numbers (only), which are separated by one or more space symbols. The $i^{th}$ number in the solution corresponds to the ``assignment'' to the $i^{th}$ variable of the problem in the original problem specification in standard input format. (Note that this may have consequences for solvers that introduce auxiliary variables, rename variables, or change their order.)
diagnostic ("d " line)
These lines are optional and start with the two following characters: lower case d followed by a space (ASCII code 32). Then, a keyword followed by a value must be given on this line. See section 5.4 for details about the diagnostics.
comment ("c " line)
Such lines are optional and start with the two following characters: lower case c followed by a space (ASCII code 32). These lines are optional and may appear anywhere in the solver output. They contain any information that authors want to output. They are recorded by the evaluation environment for later viewing but are otherwise ignored. At most one megabyte of solver output will be recorded. So, if a solver is very verbose, some comments may be lost.
Submitters are advised to avoid outputting comment lines which may be useful in an interactive environment but otherwise useless in a batch environment. For example, outputting comment lines with the number of constraints read so far only increases the size of the logs with no benefit.
If a solver is really too verbose, the organizers will ask the submitter to remove some comment lines.
objective cost ("o " line) (for Max-CSP and WCSP, only)
These lines start with the two following characters: lower case o followed by a space (ASCII code 32). These two characters are followed by one integer.
See Subsection 5.3 about its use.

Important: Any line must be terminated by a Line Feed character: the usual Unix line terminator ' $\backslash$ n' (ASCII code 10). A "v " line which does not end with that terminator will be ignored because it will be considered that the solver was interrupted before it could output a complete solution. Also, it is important that you don't forget to flush the output as soon as you have printed a "s " line or a "v " line.

5.2 Specific rules for CSP Solvers

A CSP solver must output exactly one "s " line (it is mandatory) and in addition, when the instance is found SATISFIABLE, exactly one "v " line. These lines are not necessary the first ones in the output since the CSP solver can output some "c " and "d " lines in any order. For a CSP solver, the "s " line must correspond to one of the following answers:

s SATISFIABLE : when the solver has found a solution
s UNSATISFIABLE : when the solver can prove that the instance has no solution
s UNKNOWN : when the solver is not able to tell anything about the instance

If the solver does not output a "s " line, or if the "s " line is misspelled, then UNKNOWN will be assumed. For a CSP solver, the "v " line provides a valuation of each variable that satisfies all constraints. This will be used to check the correctness of the answer.

5.3 Specific rules for Max-CSP and WCSP solvers

Since a MAX-CSP or WCSP solver will not stop as soon as it finds a solution but instead will try to find a better solution, it must be given a way to output the best solution it found even when it reaches the time limit. There are two options depending on your ability to intercept signals. The first option may be used if your solver is able to catch the signal SIGTERM which will be sent by the evaluation environment at the time out and just one second before the solver is actually killed by a SIGKILL. This is the preferred option. The second option should be used when the first option is inapplicable.

You can intercept signals:
Whenever your Max-CSP or WCSP solver finds a better solution (or simply the first one) during search, it can output an "o " line with the current cost (number of unsatisfied constraints for Max-CSP) (these lines are not mandatory, but it is strongly advised to output them). Important: For Max-CSP, this is the number of unsatisfied constraints that must be output and not the number of satisfied constraints (as in 2006). Then, if your solver finds an optimal solution and proves its optimality (i.e., it can prove that no other assignment of the variables will provide a lower cost than this one) then it must output a "s " line with OPTIMUM FOUND, followed by a "v " line containing the optimal solution. If your solver is interrupted, then it must output a "s " line with SATISFIABLE, followed by a "v " line containing the best found solution (keep in mind that you have only one second to do this).
This option saves some time as the solver avoids to output a certificate for each solution it found. It only outputs a certificate for the best solution which it was able to find.
Examples of C/C++ code to intercept the SIGTERM signal are available on the PB06 web site: http://www.cril.univ-artois.fr/PB06/coding.html. Although this page gives code which is not directly suitable for the CSP competition, it should be straightforward to adapt it.
If you can't (or don't want to) catch the SIGTERM signal:
Then all you have to do is to output a "s " line with SATISFIABLE when the first solution is found, and a certificate "v " line each time you find a solution which is better than the previous ones accompanied (this is mandatory) with an "o " line. Only the last complete certificate will be taken into account. If eventually, your solver proves that the last solution that was output is optimal, then it must output "s OPTIMUM FOUND".

Example: a Max-CSP solver (that can intercept signal) first finds a solution with unsatisfied constraints, then finds another solution with unsatisfied constraints, and finally, a solution with unsatisfied constraint. Some time later, it proves that no better solution exists and it outputs "s OPTIMUM FOUND" followed by the solution which only violates 1 constraint. The output of this solver will be:

      o 19
      o 16
      o 1
      s OPTIMUM FOUND
      v 1 4 7 8 3 4

The evaluation environment will automatically timestamp each of these lines so that it is possible to know when the solver has found a better solution and the cost of the solution. The goal is to analyse how solvers progress toward the best solution. The timestamped output will be for example:

      0.57   o 19
      1.23   o 16
      2.7    o 1
      10.5   s OPTIMUM FOUND
      10.51  v 1 4 7 8 3 4

The first column in this example is the time at which the line was output by the solver (expressed in seconds of wall clock time since the beginning of the program).

A solver which doesn't intercept the SIGTERM signal may output for the same problem

      c Got a first solution !
      s SATISFIABLE
      o 19
      v 1 1 1 1 1 1
      c Found a better solution
      o 16
      v 1 2 1 1 1 1
      c Found a better solution
      o 1
      v 1 4 7 8 3 4
      s OPTIMUM FOUND

5.4 Diagnostics

A diagnostic is a (name,value) pair which describes the work carried out by the solver. They have to be written to stdout as a "d " line. Each diagnostic is a line of the form d NAME value, where NAME is a sequence of letters describing the diagnostic, and value is a sequence of characters defining the its value. The following diagnostics are predefined:

CHECKS:: The total number of consistency checks which have been carried out.
SAC_CHECKS:: The total number of singleton arc consistency checks which have been carried out.
ASSIGNMENTS:: The total number of assignments in the search tree.

Contestants wishing to record other diagnostics than the ones listed before should inform Marc van Dongen (dongen@cs.ucc.ie) about the names and nature of the diagnostics.

6 Entering the Competition

For CSP, Max-CSP and WCSP, the contestants can enter the competition with one or two solvers per solver categories (complete/incomplete). This means that a contestant may submit up to 12 solvers in total. Contestants are expected to submit their solver(s) and contribute some instances (as many instances as wished). Submitted instances will be made available on the evaluation web site shortly after the actual beginning of the competition. We cannot accept benchmarks which cannot (for various reasons) be publicly available (because anyone must be able to reproduce the experiments of the competition). Each contestant will have the possibility to select 25 instances (that can be kept hidden) which will be guaranteed to be used for the competition. They will also have to submit a position paper (at least 3 pages) in a second stage.

We expect that contestants propose solvers that recognize the XML format XCSP 2.1 (either natively or by embedding a conversion procedure). Contestants whose solver cannot be interfaced to read the XML format must get in touch with the organizers.

The deadline for submitting both benchmarks and solvers is May 30, 2008. Submission of solvers and benchmarks will be available online in May 2008 at http://www.cril.univ-artois.fr/CPAI08/.

7 Ranking

For each problem (CSP, Max-CSP and WCSP) and each combination of instance and solver category, there will be a ranking of solvers. The categories are described in Section 2. The main criteria for ranking the solvers are as follows.

CSP:

Solvers claiming incorrect results in a given category will be disqualified from this category. Of the remaining solvers, the solver solving the most problems will be declared the winner. Ties will be broken by considering the minimum total solution time.

Max-CSP/WCSP:

Solvers claiming incorrect results in a given category will be disqualified from this category. The remaining solvers will be ranked according to the following:

Incomplete solvers: will be ranked in increasing order of their average relative error (i.e. the average normalized difference between the cost of solutions found by the solver and the cost of the best known solutions).
Complete solvers: will be ranked in decreasing order of the number of instances for which a solution has been proved to be OPTIMUM. To break ties, we will rank the solvers in increasing order of their average relative errors.

8 Competition committees

8.1 Organizing committee

The following people are in charge of running the competition. They can be reached at cspcomp <at> cril.univ-artois.fr .

Marc VAN DONGEN, dongen <at> cs.ucc.ie ,
University College Cork, Ireland.
Christophe LECOUTRE, lecoutre <at> cril.fr ,
Centre de Recherches en Informatique de Lens, Université d'Artois, France.
Olivier ROUSSEL, olivier.roussel <at> cril.univ-artois.fr ,
Centre de Recherches en Informatique de Lens, Université d'Artois, France.

8.2 Judges

Three judges are in charge of taking decisions when rules are unclear and validating the results of the competition. The selection of instances used in the competition will be devoted to an independent committee nominated by the judges. The judges can be reached at cspcomp-jury <at> cril.univ-artois.fr . They are:

Pierre FLENER, Pierre.Flener <at> it.uu.se ,
Uppsala University, Sweden
Rick WALLACE, r.wallace <at> 4c.ucc.ie ,
Cork Constraint Computation Centre (4C), Ireland

Roland YAP who was initially member of the jury chose to resign on May, 28 when he decided to submit a solver.

8.3 Working Group

A working group in charge of generating/collecting instances and developing tools has been formed. It can be reached at cspcomp-wg <at> cril.univ-artois.fr . Currently, people of this working group are:

Emmanuel HEBRARD, e.hebrard <at> 4c.ucc.ie ,
Cork Constraint Computation Centre (4C), Ireland
Barry O'SULLIVAN, b.osullivan <at> cs.ucc.ie ,
Cork Constraint Computation Centre (4C), Ireland
Andrea RENDL, andrea <at> cs.st-andrews.ac.uk ,
School of Computer Science, University of St Andrews, UK
Sebastien TABARY, tabary <at> cril.fr ,
Centre de Recherches en Informatique de Lens, Université d'Artois, France

There is one rule about the membership of this working group: "Someone in this working group who also decides to participate to the 2008 competition must make public (one month before the deadline of the competition) any instance that he has developed/generated in the context of this working group." Also, note that anybody interested in joining the working group is welcome.

Footnotes

... contestants ¹: Recall that, in 2005, 15 CSP solvers proposed by 10 teams were entering the competition and run against a suite of 1064 instances, and that, in 2006, 23 CSP solvers proposed by 12 teams were entering (the second round of) the competition and run against a suite of 3425 instances (also, 9 Max-CSP solvers proposed by 5 teams were entering the competition and run against a suite of 1069 instances).