Skip to content

Latest commit

 

History

History
145 lines (126 loc) · 5.5 KB

langs.md

File metadata and controls

145 lines (126 loc) · 5.5 KB

Information about Source Languages

Language Definitions

C0deine has a variety of source languages that build upon each other by adding new features. Here is a list of currently supported languages (as well as links to their definition/information about them):

  • L1 is a simple language that only allows for integer variables and straight-line code.
  • L2 expands on L1 by allowing for control-flow constructs (if statements, while-loops, etc.) as well as boolean variables and operations.
  • L3 adds the ability to declare and define other functions besides main. It also adds typedefs.
  • L4 finally reaches a usable programming language with pointers, structs, arrays.
  • C0 is the primary source language. It adds char, string types as well as contracts to reason about code. The original definition of C0 can be found here

For a general summary, here is a reprint of differences between C0 and C from 2.10.1

  • No unions
  • No casting
  • No pointer arithmetic
  • No sizeof operator
  • No address-of operator (&)
  • No storage modifiers (static, volatile, register, extern)
  • No labels, goto, switch statements or do ... while loops
  • No assignment in expressions
  • No floating point datatypes or literals
  • No complex types
  • No unsigned or other integral types besides int
  • Structure types may not be declared as local variables or used as return types for functions
  • No comma separated expressions
  • No explicit memory deallocation
  • Allocation is done using types not a size in bytes.
  • No fixed size arrays
  • No stack allocated arrays

Eventually, we intend to extend C0deine to support C1 (and potentially beyond) which addresses some of the limitations of C0.

Deviations

This implementation makes a few (known) deviations from the source languages defined above as well as some behaviour clarifications:

  • The keyword error from C0 is now reserved in all languages (not just C0).

  • The compiler #use directive from C0 can be used in L3/L4.

  • Currently C0deine doesn't run purity checks on contracts. Thus, contracts cannot be dynamically checked.

  • A bug in the cc0 reference compiler means some operations occur out of order when using assign-ops:

    int main() {
      int[] x = alloc_array(int, 0);
      x[0] += 1 / 0;
      return 0;
    }

    The reference semantics indicate this should be a memory error but the cc0 compiler produces a FPE. C0deine correctly raises a memory error.

  • The error statement is specified to "immediately terminate the program". Likewise, the reference semantics state that "Along each control-flow path in the body of each block in each function, each locally declared variable is initialized before its use." (C0.24). Thus, it follows that we should accept the following program (since error ends the control flow path):

    int main() {
      int y;
      if(false) {
        error("this terminates the program");
        return y;
      } else {
        return 0;
      }
    }

    The cc0 compiler incorrectly rejects this program (stating that y is uninitialised). C0deine correctly accepts this program.

  • In the C0 grammar, the definition for <spec> implies that new lines \n could be placed after the contract keyword or even within the expression. This causes issues when using the line annotations:

    int exp(int k, int n)
    //@requires n
          >= 0;
    // rest of program omitted...

    The grammar indicates that this is a valid program. C0deine (as does cc0) choose to reject this program.

  • In the semantics for annotations an "@assert cannot annotate functions" (C0.25). This is the only comment made about @assert, implying that they could procede loop bodies:

    int main(){
      int i = 0;
      while(i < 10)
      //@assert i <= 10;
      {
        i++;
      }
      return i;
    }

    It is unclear what the intended semantics of this would be, so C0deine rejects this program (as does cc0).

  • The grammar allows for ''' to be parsed as a valid char. C0deine accepts this (as does cc0) even if C compilers do not.

Standard Libraries

In order to be able to run and interface with generated code, C0deine requires additional functions. A C-style header file for these functions can be found in libs/c0deine.h.

Beyond this, C0 defines a few standard libraries here. In the future, we hope for C0deine to fully support all of these libraries in some capacity. For now, these are the only ones that are currently supported:

  • Input/Output
    • conio (except for the printf function)
  • Strings

The C0-implemented libraries should also work, but are currently not distributed here.

The C0deine library is already included with the outputted WASM code, but the other libraries can be used via the #use <lib> compiler directive. Libraries are only included once even if multiple files import them.

The example already provides necessary imports to make these libraries work fully.