Difference between revisions of "Implemented Problem Format"

Latest revision as of 02:17, 8 January 2013

Overview

This document describes a standard for distributing and sharing problems for algorithmic programming contests. It is primarily designed for the ACM ICPC but is applicable also to other contests.

All problems must have a "short name" consisting solely of lower case letters a-z and digits 0-9. All files related to a given problem are provided in a directory named after the short name of the problem.

All source code file names must match the following regexp

[a-zA-Z0-9][a-zA-Z0-9_.-]*[a-zA-Z0-9]

I.e., it must be of length at least 2, consist solely of lower or upper case letters a-z, A-Z, digits 0-9, period, dash or underscore, but must not begin or end with period, dash or underscore.

All text files for a problem should be UTF-8 encoded.

Problem Metadata

Metadata about the problem (e.g., source, license, limits) are provided in a UTF-8 encoded YAML file named <short-name>/problem.yaml.

The keys are defined as below. All keys are optional except rights_owner. Any unknown keys should be ignored.

Key	Type	Default	Comments
author	String or sequence of strings	Unknown	Who should get author credits. This would typically be the people that came up with the idea, wrote the problem specification and created the test data. This is sometimes omitted when authors choose to instead only give source credit, but both may be specified.
source	String		Who should get source credit. This would typically be the name (and year) of the event where the problem was first used or created for.
license	String	cc by-sa	License under which the problem may be used. Value have to be one of the ones defined below.
rights_owner	String	[mandatory]	Owner of the copyright of the problem. If not present, author is owner.
limits	Map with keys as defined below	see definition below
validator	String		set of values from below (delimited by spaces)

limits

A map with the following keys:

Key	Comments	Default
time_multiplier	optional	5
time_safety_margin	optional	2
memory	optional, in Mb	2048
output	optional, in Mb	8
compilation_time	optional, in seconds	60
validation_time	optional, in seconds	60
validation_memory	optional, in Mb	2048
validation_output	optional, in Mb	8

validator

A space separated list from the following:

Value	Comments
case_sensitive	upper/lower case differences are significant. If this parameter is not specified, any changes in case are to be ignored.
space_change_sensitive	whitespace differences are significant. If this parameter is not specified, any non-zero amounts of whitespace are considered identical.
float_relative_tolerance X	accepts token if it is a floating point number and the relative error is <= X
float_absolute_tolerance X	accepts token if it is a floating point number and the absolute error is <= X
float_tolerance X	accepts token if either "float_relative_tolerance X" or "float_absolute_tolerance X" would accept
custom	use a custom output validator. Must be the first value. All following values will be passed as command-line arguments to each of the output validators

Problem Statements

The problem statement of the problem is provided in the directory <short_name>/problem_statement/.

This directory must contain one LaTeX file per language, named problem.<language>.tex, that contains the problem text itself, including input and output specifications, but not sample input and output. Language must be given as an ISO 639-1 alpha-2 language code. Optionally, the language code can be left out, the default is then English.

A template will be provided that \imports this file as well as the sample input and output. The format of the problem statement is described in Problem Statement Template. Files needed by this file must all be in <short_name>/problem_statement/ , problem.tex should reference auxiliary files as if the working directory is <short_name>/problem_statement/.

Test data

The test data are provided in subdirectories of <short_name>/data/. The sample data in <short_name>/data/sample/ and the secret data in <short_name>/data/secret/.

All input and answer files have the filename extension .in and .ans respectively. Optionally descriptions of a data set (a matching pair of .in and .ans files) may be provided, as follows:

A text file with filename extension .desc, which is only shown to judges of the judging system.
A text file with filename extension .hint, which is shown to all users.

The .desc file is meant to be privileged information, whereas the .hint file is meant to be given to anybody who needs it, i.e. fails to solve the problem. The hint file might not be used at all, depending on how the problem is used, e.g. when used in a programming contest.

Input, answer, and description files are matched by the base name.

Test files will be used in lexicographical order. If a specific order is needed a numbered prefix such as 00, 01, 02, 03, and so on, can be used.

Example Submissions

Correct and incorrect solutions to the problem are provided in a directory <short_name>/submissions/. The submissions/ directory has subdirectories corresponding to the judgement that the programs are supposed to receive:

submissions/accepted/: Solutions correctly solving the problem
submissions/wrong_answer/: Submissions producing incorrect answers
submissions/time_limit_exceeded/: Submissions that are supposed to be too slow
submissions/run_time_error/: Submissions that are supposed to crash

Every file or directory in these directories represents a separate solution. Same requirements as for submissions with regards to filenames. It is mandatory to provide at least one accepted solution. Everything else is optional, and empty subdirectories can be omitted.

Input/Output Methods

Submissions must operate in such a way that they receive problem input data on their standard input stream, and write to their standard output stream.

Included Code

Code that should be included with all submissions are provided in one directory per supported language, called <short_name>/include/<language>/.

The files should be copied from a language directory based on the language of the submission, to the submission files before compiling, overwriting files from the submission in the case of name collision. Language must be given as one of the language codes in the table below. If any of the included files are supposed to be the main file (i.e. a driver), that file must have the langage dependent name as given in the table below.

Code	Language	Default main file
c	C
cpp	C++
java	Java	Main.java
python	Python	main.py

Validators

Input Format Validators

Input Format Validators, for verifying the correctness of the input files, are provided in <short_name>/input_format_validators/. They must adhere to the Input format validator standard.

Output Validators

Output Validators are used if the problem requires more complicated output validation than what is provided by the default diff variant described below. They are provided in <short_name>/output_validators/, and must adhere to the output validator standard.

File Conventions For Validators

A validator is either a file or a directory. A validator in the form of a directory may include two scripts "build" and "run". Either both or none of these scripts must be included. If the scripts are present, then:

validator must be compiled by executing the build script.
the validator must be run by executing the run script.

Otherwise, the validator will be compiled and run as if it was a submission (except that it is given the command-line arguments specified in problem.yaml, if there are any).

Default Validator Capabilities

The default validator is essentially a beefed-up diff. In its default mode, it tokenizes the files to compare and compares them token by token. It supports the following command-line arguments to control how tokens are compared.

Arguments	Description
`case_sensitive`	indicates that comparisons should be case-sensitive.
`space_change_sensitive`	indicates that changes in the amount of whitespace should be rejected (the default is that any sequence of 1 or more whitespace characters are equivalent).
`float_relative_tolerance ε`	indicates that floating-point tokens should be accepted if they are within relative error ≤ ε (see below for details).
`float_absolute_tolerance ε`	indicates that floating-point tokens should be accepted if they are within absolute error ≤ ε (see below for details).
`float_tolerance ε`	short-hand for applying ε as both relative and absolute tolerance.

When supplying both a relative and an absolute tolerance, the semantics are that a token is accepted if it is within either of the two tolerances. When a floating-point tolerance has been set, any valid formatting of floating point numbers is accepted for floating point tokens. So for instance if a token in the answer file says 0.0314, a token of 3.14000000e-2 in the output file would be accepted. If no floating point tolerance has been set, floating point tokens are treated just like any other token and has to match exactly.

Determination of Time Limit

The execution time limit for the problem is determined as follows. First, all the provided accepted/ solutions are run. Let t_max be the maximum running time of these solutions on any of the input files. The time limit is then set to be t_lim = ⌈t_max · M⌉ where M is the value of the time multiplier parameter from the limits configuration of problem.yaml.

Furthermore, it is required that all of the provided time limit exceeded submissions run for at least t_lim · S seconds, where S is the value of the time safety margin parameter from the limits configuration.

Verification

Solutions or validators in languages that are not supported by the CCS should be ignored and a warning to that effect shown.

Verification Checks (in order)

Check files (all files present as required + check problem.yaml)
Check compile (check that all programs compile)
Check input (run input validators)
Check solutions (run all solutions check that they get the expected verdicts)

Warn if:
      there are no *.in in data/sample/

Error if:
      there is no problem.yaml
      there is no problem statement (i.e., a problem*.tex file)
      any value in problem.yaml is invalid
      there are no *.in in data/secret/
      there are .in files without corresponding .ans files in data/*/
      there are .ans files without corresponding .in files in data/*/
      there are no solutions in submissions/accepted/
      there are no validators in input_format_validators/
      validator begins with "custom" and there are no validators in output_validators/
      there are validators in output_validators/ and validator does not begin with "custom"
      any validator (input format or output) does not compile

For each *.in in public_data and judge_data:
      For each validator in input_format_validators/:
              If the validator does not accept the input file: Error!

For each solution in test_submissions/accepted/:
      For each *.in in data/*/:
              Run the solution on the input
              For the built-in validator if corrector is "diff" or each validator in output_validators/:
                      If the validator does not accept the output of the solution: Error!
Let t be the longest time any of the solutions ran on any of the inputs.

For each solution in submissions/time_limit_exceeded/:
      For each *.in in data/*/:
              Run the solution on the input for at least t * time_limit_safety_margin seconds.
Let t_slow be the shortest time any of the solutions ran on any of the inputs.

If t_slow is less than t * time_limit_safety_margin: Error!

For each solution in submissions/wrong_answer/:
      For each *.in in data/*/:
              Run the solution on the input
              For the built-in validator if corrector = "diff" or each validator in output_validators/:
                      If the validator accepts the output of the solution: Error!

For each solution in submissions/run_time_error/:
      For each *.in in data/*/:
              Run the solution on the input
              If the solution is not judged Run-Time Error: Error!

Appendix

Configuration file format

problem.yaml is a UTF-8 encoded YAML file consisting of a mapping with the following keys:

Key	Comments	Example
source	optional	ICPC World Finals 2011
author	optional, defaults to "Unknown"
license	optional, defaults to "cc by-sa"
rights_owner	mandatory	ICPC
keywords	optional
difficulty	optional
limits	Sequence of mappings with keys as defined below
validator	optional, set of values from the following (delimited by spaces): "case_sensitive" - upper/lower case differences are significant. If this parameter is not specified, any changes in case are to be ignored. "space_change_sensitive" - whitespace differences are significant. If this parameter is not specified, any non-zero amounts of whitespace are considered identical. "float_relative_tolerance X" - accepts token if it is a floating point number and the relative error is <= X "float_absolute_tolerance X" - accepts token if it is a floating point number and the absolute error is <= X "float_tolerance X" - accepts token if either "float_relative_tolerance X" or "float_absolute_tolerance X" would accept "custom" - use a custom output validator. Must be the first value. All following values will be passed as command-line arguments to each of the output validators

limits

A sequence of mappings with the following keys:

Key	Comments	Example
time_multiplier	optional, defaults to 5	3.5
time_safety_margin	optional, defaults to 2	1.5
memory	optional, in Mb, defaults to 2048	3072
output	optional, in Mb, defaults to 8	4
compilation_time	optional, in seconds, defaults to 60	120
validation_time	optional, in seconds, defaults to 60	120
validation_memory	optional, in Mb, defaults to 2048	3072
validation_output	optional, in Mb, defaults to 8	4

Sample problem.yaml

Typical problem.yaml:

# Problem configuration
source: ICPC Mid-Atlantic Regional Contest
author: John von Judge 
rights_owner: ICPC

Maximal problem.yaml:

# Problem configuration
source: ICPC Mid-Atlantic Regional Contest
author: John von Judge 
license: cc by-sa  
rights_owner: ICPC

limits:
   time_multiplier: 5
   time_safety_margin: 2
   memory: 4096  
   output: 16 
   compilation_time: 240
   validation_time: 240
   validation_memory: 3072
   validation_output: 4

validator: space_change_sensitive float_absolute_tolerance 1e-6

Directory structure

<short_name>/
      problem.yaml - problem configuration file
      problem_statement/
              problem.tex - problem statement
              - any files that problem.tex needs to include, e.g. images
      data/
              sample/
                      *.in - sample input files
                      *.ans - sample answer files
              secret/
                      *.in - input files
                      *.ans - answer files
                      *.txt - optional data file description
      submissions/
              accepted/
                      - single file or directory per solution
              wrong_answer/
                      - single file or directory per solution
              time_limit_exceeded/
                      - single file or directory per solution
              run_time_error/
                      - single file or directory per solution
      input_format_validators/
              - single file or directory per validator
      output_validators/
              - single file or directory per validator

Sample Directory / Filenames

This is a sample list of directories/files for a problem named squares

squares/problem.yaml
squares/problem_statement/problem.en.tex
squares/problem_statement/problem.sv.tex
squares/problem_statement/square1.png
squares/problem_statement/square2.png
squares/data/sample/squares_sample1.in 
squares/data/sample/squares_sample1.ans
squares/data/sample/squares_sample2.in 
squares/data/sample/squares_sample2.ans
squares/data/secret/squares1.in 
squares/data/secret/squares1.ans
squares/data/secret/squares1.txt
squares/data/secret/squares2_cornercases.in 
squares/data/secret/squares2_cornercases.ans
squares/data/secret/squares3_bigcases.in
squares/data/secret/squares3_bigcases.ans
squares/submissions/accepted/squares.cpp
squares/submissions/accepted/Squares.java
squares/submissions/accepted/squares.c
squares/submissions/wrong_answer/wrong.cpp
squares/submissions/time_limit_exceeded/tle.c
squares/submissions/run_time_error/rte.c
squares/input_format_validators/squares_input_checker1.py
squares/input_format_validators/squares_input_checker2/check.c
squares/input_format_validators/squares_input_checker2/data.h
squares/output_validators/squares_validator/validator.f
squares/output_validators/squares_validator/build
squares/output_validators/squares_validator/run

@@ Line 124: / Line 124: @@
 Submissions must operate in such a way that they receive problem input data on their standard input stream, and write to their standard output stream.
+== Included Code ==
+Code that should be included with all submissions are provided in one directory per supported language, called <tt><short_name>/include/<language>/</tt>.
+The files should be copied from a language directory based on the language of the submission, to the submission files before compiling, overwriting files from the submission in the case of name collision. Language must be given as one of the language codes in the table below. If any of the included files are supposed to be the main file (i.e. a driver), that file must have the langage dependent name as given in the table below.
+{| class="wikitable"
+! Code !! Language !! Default main file
+|-
+| c || C ||
+|-
+| cpp || C++ ||
+|-
+| java || Java || Main.java
+|-
+| python || Python || main.py
+|}
 == Validators ==
@@ Line 129: / Line 148: @@
 === Input Format Validators ===
-Input Format Validators, for verifying the correctness of the input files, are provided in <tt><short_name>/input_format_validators/</tt>.  They must adhere to the [[Input format validator]] standard.
+Input Format Validators, for verifying the correctness of the input files, are provided in <tt><short_name>/input_format_validators/</tt>.  They must adhere to the [[Implemented input format validator|Input format validator]] standard.
 === Output Validators ===
-Output Validators are used if the problem requires more complicated output validation than what is provided by the default diff variant described below.  They are provided in <tt><short_name>/output_validators/</tt>, and must adhere to the [[Output validator| Output Validator]] standard.
+Output Validators are used if the problem requires more complicated output validation than what is provided by the default diff variant described below.  They are provided in <tt><short_name>/output_validators/</tt>, and must adhere to the [[Implemented output validator| output validator]] standard.
 === File Conventions For Validators ===

Difference between revisions of "Implemented Problem Format"

Latest revision as of 02:17, 8 January 2013

Contents

Overview

Problem Metadata

limits

validator

Problem Statements

Test data

Example Submissions

Input/Output Methods

Included Code

Validators

Input Format Validators

Output Validators

File Conventions For Validators

Default Validator Capabilities

Determination of Time Limit

Verification

Appendix

Configuration file format

limits

Sample problem.yaml

Directory structure

Sample Directory / Filenames

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools