Marathon shapes file format

Welcome. Here you will find a detailed description of Marathon shapes files: what they contain, how their content is packed and how to retrieve every piece of information. This document refers to files belonging to Marathon II: Durandal, Marathon Infinity and AlephOne, the open source evolution of the Marathon II engine. It doesn't currently cover the original Marathon shapes file format, which is similar but based on resource forks and thus is strictly bound to the MacOS architecture. All this information comes from my personal Marathon experience, a document found in the AlephOne engine web site, talks with the AlephOne crew, the original Marathon II sources and a bit of experimentation while writing the ShapeFusion editor. There may be errors and inaccuracies; reports are welcome!

Introduction to shapes files

Basic Marathon II scenarios are made of several files: a map file, a sounds file, an images file and a shapes file, which we are going to see. The shapes file contains

most of game graphics, stored as 8-bit color bitmaps: wall textures, landscapes, monster sprites, weapon sprites, graphics that makes up the user interface displayed while the game is running;
color tables for all that graphics;
animation data for weapons and sprites: timings, animation sequences, references to sounds, scale factors and so on.

The shapes file does not contain

level geometry, level parameters, item placement within levels. That info is stored in the map file;
terminal data. This is stored in the map file too;
physics data for monsters and playes. This is stored again in the map file;
sounds. Those are packed in the sounds file;
chapter screens and main menu graphics; those are stored in the images file.

The shapes file is organized in collections. Each collection groups data belonging to a certain piece of the game; for example, there is a collection for each monster type, one for each texture set, one for each landscape and so on. Each collection is able to store two versions of the same data: one to be used when the game is run in 8-bit color displays, and one to be used when the game is experienced on modern true-color displays (this includes the AlephOne OpenGL mode). Note that the true-color version is rarely specified; it's mainly used where its effects are really noticeable: landscapes and weapons in hand. Also note that the true-color data does not contain true-color graphics! It's just 8-bit graphics like the other version, but with specialized color tables and usually improved pixel resolution.

I like to see collections as folders; each folder contains another folder for 8-bit data and, optionally, a folder for true-color data. Maybe this analogy is useful for you too.

Each collection version stores color tables, bitmaps, frames and sequences.

Color tables are simple arrays of RGB colors; all tables of a collection version store the same amount of colors.
Bitmaps are the true graphics data; they store 8-bit images as indexes in color tables. They support a simple 1-bit transparency mechanism and can be compressed with a very simple RLE algorithm.
Frames are data structures that reference bitmaps and carry additional data useful for building animations. For example, they have flags for reflecting the associated bitmap. In engine terminology they are known as low level shapes.
Sequences organize frames in views, or orientations with respect to the observer. Each view may contain a single frame (for unanimated objects like lamps and puddles) or a sequence of frames (for animated objects like running monsters). A sequence is made of a variable number of views and each view references a variable number of frames. For example, the running Pfhor animation is described by a single sequence, containing a number of views (because the running Pfhor can be viewed from different angles). Each view contains a list of frame indexes. In engine terminology, sequences are called high level shapes.

Having different color tables is an easy way to change the color of all animation sprites on the fly.

Since Marathon was originally written as a Mac application, shapes files are stored in big-endian byte order. As explained in the following sections, data structures contain a lot of unused space that was originally used at runtime to store pointers and other info.

The collection headers

The shapes file begins with 32 collection headers. A collection header is 32 bytes long and has the following structure:

short	status
unsigned short	flags
long	offset8
long	length8
long	offset16
long	length16
unsigned char	unused[12]

status and flags seem to be unused and always set to 0. offset8 and length8 specify the position and size of the 8-bit collection version data block within the shapes file. Of course, offset16 and length16 do the same for the true-color collection version. Offsets are relative to the beginning of the file, and may be set to -1 to indicate that the correponding version is not present.

The collection definition

If you follow one of the offsets in the collection header, you find the corresponding collection version data block. This begins with a 544-byte collection definition data structure:

short	version
short	type
unsigned short	flags
short	colors_per_table
short	color_table_count
long	color_tables_offset
short	high_level_shape_count
long	high_level_shape_table_offset
short	low_level_shape_count
long	low_level_shape_table_offset
short	bitmap_count
long	bitmap_table_offset
short	scale_factor
long	collection_size
unsigned char	unused[506]

version gives the file format version (3 for the format described in this document). type tells what kind of data is contained in this collection. flags should be unused and is usually 0, but I've found cases with 1 instead (Rubicon shapes). I don't yet know what this means. scale_factor is meant to be the "pixels to world" conversion factor for the whole collection, but seems ignored by the AlephOne engine and probably has no effect. collection_size tells the total size of this collection version; it must be equal to the corresponding length8 or length16 field in the collection header. Other fields should be self-explanatory; table offsets point to arrays of longs specifying offsets to bitmaps, frames and sequences. Important: all these offsets are relative to the collection definition, not to the beginning of the file.

Color tables

The color_tables_offset field of the collection definition points to an array of color_table_count color tables, which are just arrays of colors_per_table 8-byte long RGB color value structures:

unsigned char	flags
unsigned char	value
unsigned short	red
unsigned short	green
unsigned short	blue

In other words, color_table_offset points to a big array of colors_per_table·color_table_count RGB color value structures. value is the color index used in bitmaps. red, green, blue are self-explanatory, but note that they are 16-bit colors, while usually you deal with 8-bit RGB values. Just shift right by 8 bits if this is the case. Finally, the most significant bit of flags is the self luminescent color flag which, if set, alters the shading properties of that color.

Bitmaps

The bitmap_table_offset field of the collection definition points to an array of bitmap_count longs, which are offsets to as many bitmap definition objects. Each bitmap definition is 30 bytes long and has the following structure:

short	width
short	height
short	bytes_per_row
short	flags
short	bit_depth
unsigned char	unused[20]

width and height are the pixel dimensions of the bitmap. bytes_per_row is -1 for compressed bitmaps, a positive value for uncompressed ones. Compression doesn't actually compress globally the pixel block, it just removes certain transparent areas, and so makes sense just for bitmaps with transparency. Bit 7 of flags is the column order flag: if enabled, pixels are stored in column-order rather than row-order (that is, first comes the first pixel column, then the second and so on). In that case bytes_per_row actually means bytes per column. Bit 6 of flags is the transparency enabled flag: if enabled, pixels set to color index 0 will be rendered as completely transparent. In shapes color tables, color 0 is traditionally set to bright blue (RGB 0, 0, 0xffff). Finally, bit_depth should always be set to 8.

After this preamble there is a certain amount of unused space to skip before coming to the actual pixel data. This space is 4·width bytes for column order bitmaps and 4·height bytes for row order bitmaps. This place stored pointers to scanlines in the original Mac engine, but has no meaning in files.

Then comes the actual pixel data. For uncompressed bitmaps it is just a block of width·height bytes, following the order specified by the column order flag. For compressed bitmaps one can't tell a priori how many bytes to read, they must be decoded. Decoding works like that: for each pixel column read two shorts, first_row and last_row. Then read last_row-first_row bytes and copy them as pixel values to the destination bitmap, starting from pixel row first_row. Proceed in the same way for every column and the bitmap is decoded. Encoding works the opposite way: for each column of the source bitmap, find the first opaque pixel and write its row, find the last and write its row incremented by 1, then copy all pixels of the column spanning between those two rows. As you see it's a very poor algorithm, efficient only for a limited set of bitmaps.

Frames

The low_level_shape_table_offset field of the collection definition points to an array of low_level_shape_count longs, which are offsets to as many low level shape definition objects. Each of these is 36 bytes long and has the following structure:

unsigned short	flags
long	minimum_light_intensity
short	bitmap_index
short	origin_x
short	origin_y
short	key_x
short	key_y
short	world_left
short	world_right
short	world_top
short	world_bottom
short	world_x0
short	world_y0
unsigned char	unused[8]

Bit 7 of flags is the X mirror flag, bit 6 the Y mirror flag and bit 5 the keypoint obscured flag. X and Y mirror flags, if set, cause the associated frame bitmap to be rendered flipped along the vertical and horizontal axis respectively. The keypoint flag makes sense only for the player collection, and controls wether the torso is drawn over the legs or the opposite.

minimum_light_intensity is a fixed point value ranging between 0 and 1 (0x10000) and specifies the minimum light intensity to use when rendering the bitmap. This is useful for creating self-luminescent sprites like flames and the hunter bolt.

bitmap_index is the associated bitmap index. -1 means the frame has no associated bitmap and thus is not valid.

origin_x and origin_y tell where the "logical" bitmap origin lies within the physical bitmap dimensions. Note that these fields are not used by the engine, they look like a temporary place for editors to calculate world_* fields.

key_x and key_y specify the position of the keypoint within the physical bitmap dimensions. The keypoint is used only for player collections and tells where to attach torso frames to leg frames: the torso origin is placed at the legs keypoint position. Note that these fields are not used by the engine, they look like a temporary place for editors to calculate world_* fields.

world_* fields encode the same info carried by the previous four fields. They represent the scaled bitmap rectangle and keypoint position in world coordinates. They are pre-computed by editors as follows:

world_left = -scale_factor ⋅ origin_x
world_top = scale_factor ⋅ origin_y
world_right = scale_factor ⋅ (width - origin_x)
world_bottom = -scale_factor ⋅ (height - origin_y)
world_x0 = scale_factor ⋅ (key_x - origin_x)
world_y0 = -scale_factor ⋅ (key_y - origin_y)

where width and height are the associated bitmap dimensions and scale_factor comes from the sequence using this frame. When that value is set to 0, the scale_factor in the collection definition is used instead.

Sequences

The high_level_shape_table_offset field of the collection definition points to an array of high_level_shape_count longs, which are offsets to as many high level shape definition objects. Each of these is 88 bytes long and follows this structure:

short	type
unsigned short	flags
char	name[34]
short	number_of_views
short	frames_per_view
short	ticks_per_frame
short	key_frame
short	transfer_mode
short	transfer_mode_period
short	first_frame_sound
short	key_frame_sound
short	last_frame_sound
short	scale_factor
short	loop_frame
unsigned char	unused[28]

type and flags should always be 0, but Rubicon has sequences with flags set to 1. I don't know what that means. name is a Pascal string that can be used to give a meaningful name to the sequence. Its first byte is the string length, followed by string chars. number_of_views specifies the number of angles the sequence can be viewed from, but it's not directly this information (the field name is quite misleading). Instead, the following table must be used:

`number_of_views`	Animation type	Actual number of views
10	unanimated	1
1	animated1	1
3	animated3to4	4 (0°, ±90°, 180°)
4	animated4	4 (0°, ±90°, 180°)
9	animated3to5	5
11	animated5	5
2	animated2to8	8 (0°, ±45°, ±90°, ±135°, 180°)
5	animated5to8	8 (0°, ±45°, ±90°, ±135°, 180°)
8	animated8	8 (0°, ±45°, ±90°, ±135°, 180°)

frames_per_view tells how many frames are used in each view. ticks_per_frame specifies the duration of the single frame in units of ticks. first_frame_sound, key_frame_sound and last_frame_sound tell which sound to play at the first, key and end frames respectively (sounds are taken from the associated scenario sounds file). scale_factor is meant to be the pixels-to-world scale factor for the sequence, but seems ignored by the AlephOne engine, and should have no effect. Referenced frames already carry all the necessary info about unit conversions.

After each high level shape definition block there is an array of signed short frame indexes, defining animation frames. You get the number of indexes as (number of views) times frames_per_view. Again, the number of views is not simply number_of_views but must be looked up in the table given before.

Last update: 2007-02-02

Tito Dal Canton

Physics is reverse engineering