You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

234 lines
5.1 KiB

Structured Headers parser for Javascript
========================================
This library is a parser and serializer for the [Structured Headers][1]
specification, a.k.a. "Structured Field Values for HTTP" (RFC8941).
This specification defines a standard serialization for complex HTTP header
values, including lists (arrays), dictionaries (maps) and also numbers,
booleans, and binary data.
The library is written in Typescript, and the examples in this document are
too, but plain Javascript is also fully supported.
Compatibility
-------------
This package has 2725 unittests, the vast majority are supplied from the
official [HTTP WG test suite][2].
However, there are 2 differences in the serializer:
1. Javascript can't differentiate between `1.0` and `1`. As a result we're
skipping the tests that require a serialiation output of `1.0`.
2. Javascript rounds slightly different from the spec. The tests suggest that
`0.0025` should round to the nearest event number (`0.002`), but Javascript
rounds to `0.003`.
No fix is planned for #1, because there's no reasonably way to fix this
without wrapping every number in a custom class, and this will negatively
impact the developer experience. We do intend to fix #2 in the future with a
custom rounding algorithm.
This library emits and expects the _exact_ data structures as they are
suggested by the RFC. The result of this is that the returned types can be
a bit complex.
In the future we intend to loosen the required types for the serializer, and
add new helper functions that give you simpler structures _if_ you don't need
certain features for a header (such as `Parameters`).
Let us know what you would like to see here!
Installation
------------
Using npm:
```
npm install structured-headers
```
API
---
### Parsing an item
The following are examples of `item` headers:
Parsed as string
```
# Parsed as string
Header: "foo"
# A simple string, called a 'Token' in the spec
Header: foo
# Parsed as number
Header: 5
Header: -10
Header: 5.01415
# Parsed into boolean
Header: ?1
Header: ?0
# Binaries are base64 encoded
Header: :RE0gbWUgZm9yIGEgZnJlZSBjb29raWU=:
# Items can have parameters
Header: "Hello world"; a="5"
```
To parse these header values, use the `parseItem`:
```typescript
import { parseItem } from 'structured-headers';
console.log(
parseItem(header)
);
```
parseItem returns a tuple (array with 2 items), the first item is the value,
the second is a `Map` object with parameters.
The type is roughly:
```typescript
// The raw value
type BareItem = number | string | Token | ByteSequence | boolean;
// The return type of parseItem
type Item = [
BareItem,
Map<string, BareItem>
];
```
### Parsing a list
A list is an array of items. Some examples:
```
# A simple list
Header: 5, "foo", bar, ?1
# Each element can have parameters
Header: sometoken; param1; param2=hi, 42
# A list can also contain lists itself. These are called 'inner lists' and
# use parenthesis
Header: sometoken, (innerlistitem1 innerlistitem2), (anotherlist)
```
To parse these:
```typescript
import { parseList } from 'structured-headers';
console.log(
parseList(header)
);
```
`parseList` returns an array with each member. The return type is:
```typescript
type InnerList = [Item[], Parameters];
type List = (InnerList|Item)[];
```
### Parsing a dictionary
A dictionary is a key->value object. Examples:
```
# A simple dictionary
Header: fn="evert", ln="pot", coffee=?1
# Each item may have parameters too
Header: foo=123; q=1, bar=123, q=0.5
# A dictionary value may be an inner list again
Header: foo=(1 2 3)
```
To parse dictionaries:
```typescript
import { parseDictionary } from 'structured-headers';
console.log(
parseDictionary(header)
);
```
The return type for `parseDictionary` is a `Map`.
```typescript
type Dictionary = Map<string, Item|InnerList>;
```
### Serializing
The serialiser functions work the exact same way, but in opposite direction.
They all return strings.
Currently the serializes expect the *exact type* that the parsers return, but
the intention is to loosen the types for serialization, so it's a bit more
ergnomic to call. Want this? Let me know by opening an issue.
```javascript
import {
serializeDictionary,
serializeList,
serializeItem
} from 'structured-headers';
// Returns "foo", "bar"
serializeList([
['foo', new Map()],
['bar', new Map()],
]);
// Returns a=1, b=?0
sh.serializeDictionary(new Map([
['a', [1, new Map()]],
['b', [false, new Map()]],
]));
// Returns 42
serializeItem([42, new Map()]);
// Returns 5.5
serializeItem([5.5, new Map()]);
// Returns "hello world"
serializeItem(["hello world", new Map()]);
// Returns ?1
serializeItem([true, new Map()]);
// Returns a base-64 representation like: *aGVsbG8=*
serializeItem([new ByteSequence('aGVsbG8='), new Map()]);
```
Browser support
---------------
There is a minified version of this library in the `browser/` directory. This minified
file will expose a global variable called 'structuredHeader' which contains the rest
of the api.
[1]: https://datatracker.ietf.org/doc/html/rfc8941
[2]: https://github.com/httpwg/structured-field-tests