handle long long type in TBAA #1241

PikachuHyA · 2024-12-19T01:57:55Z

The #1220 adds support for scalar types. However, it does not handle the long long type correctly; instead, long long is treated as long. Specifically, long long is represented as s64i in CIR, which is then mapped to the long type.

The text was updated successfully, but these errors were encountered:

Lancern · 2024-12-19T03:06:21Z

The problem here is that CIR integer types do not have a 1:1 correspondence to C/C++ integer types, so you could not tell whether a s64i come from long or long long or std::int64_t. On contrary, CIR floating point types have a 1:1 correspondence to C/C++ floating point types. long double would be lowered to !cir.long_double instead of !cir.double or !cir.fp80 depending on the target. Maybe we should do the same for integer types?

PikachuHyA · 2024-12-19T06:47:33Z

Maybe we should do the same for integer types?

That sounds like a great idea! I will attempt to add !cir.long_long<underlying>, similar to this:

def CIR_LongLong : CIR_Type<"LongLong", "long_long"> {
  let parameters = (ins "mlir::Type":$underlying);

  let assemblyFormat = [{
    `<` $underlying `>`
  }];
}

bcardosolopes · 2024-12-19T22:04:18Z

This is a bit more tricky than it sounds, it would be odd to have cir.long_long but not do the same for all other integer types (e.g. on some architectures int can be 2 bytes and would suffer from a similar problem).

I do agree FP is doing the more full and nicer path and it'd be nice to have it for integers (makes sense to better map the source info), but is the type indirection worth it? Seems like the only use so far is for handling TBAA - I'm curious if there are solutions where we can solve this without introducing new types?

(cc @dkolsen-pgi @sitio-couto)

dkolsen-pgi · 2024-12-19T22:35:37Z

Floating-point types in ClangIR do not have a 1-to-1 mapping to formats. Or they won't once extended floating-point support is added to Clang. double, _Float64, and _Float32x will all be distinct types in C++, but they will all map to the same !cir.double type. I am not thrilled with long double being a special kind of floating-point type that has an underlying type.

Doing something special for just the type long long is not the correct thing. If there is just one integral type that should be handled specially, it is long. long long is 64-bits on all supported platforms. char, short, and int are 8, 16, and 32 bits on almost all supported platforms. It is long that varies between 32 bits (Windows) and 64 bits (MacOS and Linux) on commonly used platforms.

If we need to preserve the original C++ type in ClangIR, for TBAA or some other purpose, then we should do it right and have separate ClangIR types for all arithmetic language types, each with a corresponding underlying format attribute. But that would be a big change and not something to be done lightly.

bcardosolopes · 2024-12-20T17:43:12Z

Agreed, same feeling here!

@PikachuHyA an alternative is to optionally attach the clangtype to CIRtypes, like we do for ast nodes on operations, TBAA could query that information to grasp higher level semantics.

PikachuHyA mentioned this issue Dec 19, 2024

[CIR][CIRGen][TBAA] Add support for scalar types #1220

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

handle long long type in TBAA #1241

handle long long type in TBAA #1241

PikachuHyA commented Dec 19, 2024

Lancern commented Dec 19, 2024

PikachuHyA commented Dec 19, 2024

bcardosolopes commented Dec 19, 2024 •

edited

Loading

dkolsen-pgi commented Dec 19, 2024

bcardosolopes commented Dec 20, 2024

handle long long type in TBAA #1241

handle long long type in TBAA #1241

Comments

PikachuHyA commented Dec 19, 2024

Lancern commented Dec 19, 2024

PikachuHyA commented Dec 19, 2024

bcardosolopes commented Dec 19, 2024 • edited Loading

dkolsen-pgi commented Dec 19, 2024

bcardosolopes commented Dec 20, 2024

bcardosolopes commented Dec 19, 2024 •

edited

Loading