Emacs reference manual read

Notes from reading Emacs Lisp Reference Manual.

1. Intro

1.1. Why the heck did I start reading this?

Well because RTFM is one of the best ways (if not the way) to actually really grasp any languages and I've been putting it off for too long since I started using Emacs. Secondly, I'm maintaining my own Emacs config and it's becoming really hard to maintain because of debugging issues I had. Finally, I was again reminded by this talk that one needs to go through the manual to get good at elisp programming.

So I started reading the manual and documenting notes here. I don't intend to run every example code snippet but pick some functions that I think are important in general and to myself. However I do try to copy and paste all functions mentioned in the manual so that I can grep this file for things I need instead of Googling randomly.

2. Lisp definitions and types

A Lisp expression that you can evaluate is called a “form”.
A “symbol” in GNU Emacs Lisp is an object with a name.
A symbol whose name starts with a colon (‘:’) is called a “keyword symbol”.
A “sequence” is a Lisp object that represents an ordered set of elements. There are two kinds of sequence in Emacs Lisp: “lists” and “arrays”.
A “cons cell” is an object that consists of two slots, called the CAR slot and the CDR slot.
- Each slot can “hold” any Lisp object.
- We also say that the CAR of this cons cell is whatever object its CAR slot currently holds, and likewise for the CDR.
- CDR is pronounced “could-er”.
An “association list” or “alist” is a specially-constructed list whose elements are cons cells.
An “array” is composed of an arbitrary number of slots for holding or referring to other Lisp objects, arranged in a contiguous block of memory. (approx. O(1) access time).
- Emacs defines four types of array: strings, vectors, bool-vectors, and char-tables.
- A “vector” is a one-dimensional array of elements of any type. It takes a constant amount of time to access any element of a vector.
- A “char-table” is a one-dimensional array of elements of any type, indexed by character codes.
- A “bool-vector” is a one-dimensional array whose elements must be ‘t’ or ‘nil’.
  - The printed representation of a bool-vector is like a string, except that it begins with ‘#&’ followed by the length.
- A hash table is a very fast kind of lookup table, somewhat like an alist in that it maps keys to corresponding values, but much faster.
A “Lisp macro” is a user-defined construct that extends the Lisp language.
A “primitive function” is a function callable from Lisp but written in the C programming language.
- Primitive functions are also called “subrs” or “built-in functions”.
- Most primitive functions evaluate all their arguments when they are called. A primitive function that does not evaluate all its arguments is called a “special form”.
A “record” is much like a ‘vector’. However, the first element is used to hold its type as returned by ‘type-of’. The purpose of records is to allow programmers to create objects with new types that are not built into Emacs.
An “autoload object” is a list whose first element is the symbol ‘autoload’.
A “finalizer object” helps Lisp code clean up after objects that are no longer needed.

The names CAR and CDR derive from the history of Lisp. The original
Lisp implementation ran on an IBM 704 computer which divided words into
two parts, the address and the decrement; CAR was an instruction to
extract the contents of the address part of a register, and CDR an
instruction to extract the contents of the decrement. By contrast, cons
cells are named for the function ‘cons’ that creates them, which in turn
was named for its purpose, the construction of cells.

(rose violet buttercup)
      ----------–— -----------–— --------------–—
     | car | cdr | | car | cdr | | car | cdr |
     | rose | o---–—>| violet | o---–—>| buttercup | nil |
     | | | | | | | | |
      ----------–— -----------–— --------------–—

((pine needles) oak maple)

      ---------–— ---------–— ---------–—
     | car | cdr | | car | cdr | | car | cdr |
     | o | o--–—>| oak | o--–—>| maple | nil |
     | | | | | | | | | |
      – | ----–— ---------–— ---------–—
         |
         |
         | ---------–— -----------–—
         | | car | cdr | | car | cdr |
          -–—>| pine | o--–—>| needles | nil |
                 | | | | | |
                  ---------–— -----------–—

alist

((rose . red) (lily . white) (buttercup . yellow))
      ----------–— -----------–— --------------–—
     | car | cdr | | car | cdr | | car | cdr |
     | rose | o---–—>| lily | o---–—>| buttercup | nil |
     | red | | | white | | | yellow | |
      ----------–— -----------–— --------------–—

[1 "two" (three)]      ; A vector of three elements.

3. Editing types

A “marker” denotes a position in a specific buffer.
A “window” describes the portion of the terminal screen that Emacs uses to display a buffer. Every window has one associated buffer, whose contents appear in the window. By contrast, a given buffer may appear in one window, no window, or several windows.
A “frame” is a screen area that contains one or more Emacs windows. We also use the term “frame” to refer to the Lisp object that Emacs uses to refer to the screen area.
A process is a Lisp object that designates a subprocess created by the Emacs process.
- An Emacs subprocess takes textual input from Emacs and returns textual output to Emacs for further manipulation. Emacs can also send signals to the subprocess.
- (process-list)
A “thread” in Emacs represents a separate thread of Emacs Lisp execution. It runs its own Lisp program, has its own current buffer, and can have subprocesses locked to it, i.e. subprocesses whose output only this thread can accept.
- (all-threads)
A “mutex” is an exclusive lock that threads can own and disown, in order to synchronize between them.
A “condition variable” is a device for a more complex thread synchronization than the one supported by a mutex. A thread can wait on a condition variable, to be woken up when some other thread notifies the condition.
A “stream” is an object that can be used as a source or sink for characters—either to supply characters for input or to accept them as output.
A “keymap” maps keys typed by the user to commands. This mapping controls how the user’s command input is executed. A keymap is actually a list whose CAR is the symbol ‘keymap’.
An “overlay” specifies properties that apply to a part of a buffer. Each overlay applies to a specified range of the buffer, and contains a property list (a list whose elements are alternating property names and values). Overlay properties are used to present parts of the buffer temporarily in a different display style. Overlays have no read syntax, and print in hash notation, giving the buffer name and range of positions.
A “font” specifies how to display text on a graphical terminal.
- font objects
- font specs
- font entities

4. Type checking (predicates)

(setq list nil)
(defun add-on (x)
  (cond ((symbolp x)
         ;; If X is a symbol, put it on LIST.
         (setq list (cons x list))) 
        ((listp x)
         ;; If X is a list, add its elements to LIST.
         (setq list (append x list))) 
        (t
         ;; We handle only symbols and lists.
         (error 
          "Invalid argument %s in add-on"
          x))))
(add-on 'a)

4.1. list of type predicates

Put this here as a quick reference for grepping.

‘atom’
     *Note atom: List-related Predicates.

‘arrayp’
     *Note arrayp: Array Functions.

‘bignump’
     *Note floatp: Predicates on Numbers.

‘bool-vector-p’
     *Note bool-vector-p: Bool-Vectors.

‘booleanp’
     *Note booleanp: nil and t.

‘bufferp’
     *Note bufferp: Buffer Basics.

‘byte-code-function-p’
     *Note byte-code-function-p: Byte-Code Type.

‘case-table-p’
     *Note case-table-p: Case Tables.

‘char-or-string-p’
     *Note char-or-string-p: Predicates for Strings.

‘char-table-p’
     *Note char-table-p: Char-Tables.

‘commandp’
     *Note commandp: Interactive Call.

‘condition-variable-p’
     *Note condition-variable-p: Condition Variables.

‘consp’
     *Note consp: List-related Predicates.

‘custom-variable-p’
     *Note custom-variable-p: Variable Definitions.

‘fixnump’
     *Note floatp: Predicates on Numbers.

‘floatp’
     *Note floatp: Predicates on Numbers.

‘fontp’
     *Note Low-Level Font::.

‘frame-configuration-p’
     *Note frame-configuration-p: Frame Configurations.

‘frame-live-p’
     *Note frame-live-p: Deleting Frames.

‘framep’
     *Note framep: Frames.

‘functionp’
     *Note functionp: Functions.

‘hash-table-p’
     *Note hash-table-p: Other Hash.

‘integer-or-marker-p’
     *Note integer-or-marker-p: Predicates on Markers.

‘integerp’
     *Note integerp: Predicates on Numbers.

‘keymapp’
     *Note keymapp: Creating Keymaps.

‘keywordp’
     *Note Constant Variables::.

‘listp’
     *Note listp: List-related Predicates.

‘markerp’
     *Note markerp: Predicates on Markers.

‘mutexp’
     *Note mutexp: Mutexes.

‘nlistp’
     *Note nlistp: List-related Predicates.

‘number-or-marker-p’
     *Note number-or-marker-p: Predicates on Markers.

‘numberp’
     *Note numberp: Predicates on Numbers.

‘overlayp’
     *Note overlayp: Overlays.

‘processp’
     *Note processp: Processes.

‘recordp’
     *Note recordp: Record Type.

‘sequencep’
     *Note sequencep: Sequence Functions.

‘string-or-null-p’
     *Note string-or-null-p: Predicates for Strings.

‘stringp’
     *Note stringp: Predicates for Strings.

‘subrp’
     *Note subrp: Function Cells.

‘symbolp’
     *Note symbolp: Symbols.

‘syntax-table-p’
     *Note syntax-table-p: Syntax Tables.

‘threadp’
     *Note threadp: Basic Thread Functions.

‘vectorp’
     *Note vectorp: Vectors.

‘wholenump’
     *Note wholenump: Predicates on Numbers.

‘window-configuration-p’
     *Note window-configuration-p: Window Configurations.

‘window-live-p’
     *Note window-live-p: Deleting Windows.

‘windowp’
     *Note windowp: Basic Windows.

5. Equality predicates

5.1. `eq` (deep compare)

(eq 'foo 'foo) ;; t
(eq ?A ?A) ;; t
(eq 3.0 3.0) ;; t or nil Equal floats may or may not be the same object.
(eq (make-string 3 ?A) (make-string 3 ?A)) ;; nil because these are two different objects
(eq "asdf" "asdf") ;; t or nil Equal string constants or may not be the same object.

(eq '(1 (2 (3))) '(1 (2 (3)))) ;; nil

(setq foo '(1 (2 (3))))
(eq foo foo) ;; t
(eq foo '(1 (2 (3)))) ;; nil
(eq (point-marker) (point-marker)) ;; nil

(eq (make-symbol "foo") 'foo) ;; nil

Odd one with make-symbol and intern (see also: elisp make-symbol vs intern).

(eq (make-symbol "foo") 'foo) ;; nil
(eq (intern "foo") 'foo) ;; t

5.2. `equal` (shallow compare)

(equal 'foo 'foo) ;; t
(equal "asdf" "asdf") ;; t
(equal (point-marker) (point-marker)) ;; t
(equal (make-symbol "foo") 'foo) ;; nil
(equal (intern "foo") 'foo) ;; t
(equal "asdf" "ASDF") ;; nil. equal is case-sensitive

5.3. `equal-including-properties` to test

This function behaves like ‘equal’ in all cases but also requires
that for two strings to be equal, they have the same text
properties.

(equal "asdf" (propertize "asdf" 'asdf t))

(equal-including-properties "asdf"
                            (propertize "asdf" 'asdf t))

6. Mutability

(aset)

A mutable object stops being mutable if it is part of an expression that is evaluated. For example:

(let* ((x (list 0.5))
           (y (eval (list 'quote x))))
  (setcar x 1.5) ;; the program should not do this.
  (car x)
  )

If a program attempts to change objects that should not be changed, the resulting behavior is undefined: the Lisp interpreter might signal an error, or it might crash or behave unpredictably in other ways.

7. Numbers

7.1. Integer

Integer representations:

1  ;; The integer 1.
1. ;; The integer 1.
+1 ;; Also the integer 1.
-1 ;; The integer −1.
0  ;; The integer 0.
-0 ;; The integer 0.

The syntax for integers in bases other than 10 consists of ‘#’ followed by a radix indication followed by one or more digits. The radix indications are ‘b’ for binary, ‘o’ for octal, ‘x’ for hex, and ‘RADIXr’ for radix RADIX. Thus, ‘#bINTEGER’ reads INTEGER in binary, and ‘#RADIXrINTEGER’ reads INTEGER in radix RADIX. Allowed values of RADIX run from 2 to 36, and allowed digits are the first RADIX characters taken from ‘0’–‘9’, ‘A’–‘Z’. Letter case is ignored and there is no initial sign or final period. For example:

(message (format "%s" #b101100))        ⇒ 44
#o54            ⇒ 44
#x2c            ⇒ 44
#24r1k  ⇒ 44

Some useful variables.

– Variable: most-positive-fixnum
    The value of this variable is the greatest “small” integer that
    Emacs Lisp can handle. Typical values are 2**29 − 1 on 32-bit and
    2**61 − 1 on 64-bit platforms.

– Variable: most-negative-fixnum
    The value of this variable is the numerically least “small” integer
    that Emacs Lisp can handle. It is negative. Typical values are
    −2**29 on 32-bit and −2**61 on 64-bit platforms.

– Variable: integer-width
    The value of this variable is a nonnegative integer that controls
    whether Emacs signals a range error when a large integer would be
    calculated. Integers with absolute values less than 2**N, where N
    is this variable’s value, do not signal a range error. Attempts to
    create larger integers typically signal a range error, although
    there might be no signal if a larger integer can be created
    cheaply. Setting this variable to a large number can be costly if
    a computation creates huge integers.

7.2. Floating-Point number

(= 0.0 0) ;; t
(= -0.0 0) ;; t
(/ 0.0 0.0) ;; -0.0e+NaN
(setq x -0.0e+NaN)
(setq y -0.0e+NaN)
(= x y) ;; nil
(equal x y) ;; t
(eq x y) ;; nil

Useful functions

– Function: isnan x
    This predicate returns ‘t’ if its floating-point argument is a NaN,
    ‘nil’ otherwise.

– Function: frexp x
    This function returns a cons cell ‘(S . E)’, where S and E are
    respectively the significand and exponent of the floating-point
    number X.

    If X is finite, then S is a floating-point number between 0.5
    (inclusive) and 1.0 (exclusive), E is an integer, and X = S * 2**E.
    If X is zero or infinity, then S is the same as X. If X is a NaN,
    then S is also a NaN. If X is zero, then E is 0.

– Function: ldexp s e
    Given a numeric significand S and an integer exponent E, this
    function returns the floating point number S * 2**E.

– Function: copysign x1 x2
    This function copies the sign of X2 to the value of X1, and returns
    the result. X1 and X2 must be floating point.

– Function: logb x
    This function returns the binary exponent of X. More precisely, if
    X is finite and nonzero, the value is the logarithm base 2 of |x|,
    rounded down to an integer. If X is zero or infinite, the value is
    infinity; if X is a NaN, the value is a NaN.

  (logb 10)
       ⇒ 3
  (logb 10.0e20)
       ⇒ 69
  (logb 0)
       ⇒ -1.0e+INF

7.3. Type Predicates for Numbers

– Function: bignump object
    This predicate tests whether its argument is a large integer, and
    returns ‘t’ if so, ‘nil’ otherwise. Unlike small integers, large
    integers can be ‘=’ or ‘eql’ even if they are not ‘eq’.

– Function: fixnump object
    This predicate tests whether its argument is a small integer, and
    returns ‘t’ if so, ‘nil’ otherwise. Small integers can be compared
    with ‘eq’.

– Function: floatp object
    This predicate tests whether its argument is floating point and
    returns ‘t’ if so, ‘nil’ otherwise.

– Function: integerp object
    This predicate tests whether its argument is an integer, and
    returns ‘t’ if so, ‘nil’ otherwise.

– Function: numberp object
    This predicate tests whether its argument is a number (either
    integer or floating point), and returns ‘t’ if so, ‘nil’ otherwise.

– Function: natnump object
    This predicate (whose name comes from the phrase “natural number”)
    tests to see whether its argument is a nonnegative integer, and
    returns ‘t’ if so, ‘nil’ otherwise. 0 is considered non-negative.

    ‘wholenump’ is a synonym for ‘natnump’.

– Function: zerop number
    This predicate tests whether its argument is zero, and returns ‘t’
    if so, ‘nil’ otherwise. The argument must be a number.

    ‘(zerop x)’ is equivalent to ‘(= x 0)’.

7.4. Comparison of Numbers

Because floating-point arithmetic is not exact, it is often a bad idea to check for equality of floating-point values. Usually it is better to test for approximate equality. Here’s a function to do this:

(defvar fuzz-factor 1.0e-6)
(defun approx-equal (x y)
  (or (= x y) 
          (< (/ (abs (- x y)) 
                        (max (abs x) 
                                 (abs y))) fuzz-factor)))

Useful functions.

– Function: = number-or-marker &rest number-or-markers
    This function tests whether all its arguments are numerically
    equal, and returns ‘t’ if so, ‘nil’ otherwise.

– Function: eql value1 value2
    This function acts like ‘eq’ except when both arguments are
    numbers. It compares numbers by type and numeric value, so that
    ‘(eql 1.0 1)’ returns ‘nil’, but ‘(eql 1.0 1.0)’ and ‘(eql 1 1)’
    both return ‘t’. This can be used to compare large integers as
    well as small ones.

– Function: /= number-or-marker1 number-or-marker2
    This function tests whether its arguments are numerically equal,
    and returns ‘t’ if they are not, and ‘nil’ if they are.

– Function: < number-or-marker &rest number-or-markers
    This function tests whether each argument is strictly less than the
    following argument. It returns ‘t’ if so, ‘nil’ otherwise.

– Function: <= number-or-marker &rest number-or-markers
    This function tests whether each argument is less than or equal to
    the following argument. It returns ‘t’ if so, ‘nil’ otherwise.

– Function: > number-or-marker &rest number-or-markers
    This function tests whether each argument is strictly greater than
    the following argument. It returns ‘t’ if so, ‘nil’ otherwise.

– Function: >= number-or-marker &rest number-or-markers
    This function tests whether each argument is greater than or equal
    to the following argument. It returns ‘t’ if so, ‘nil’ otherwise.

– Function: max number-or-marker &rest numbers-or-markers
    This function returns the largest of its arguments.

  (max 20)
       ⇒ 20
  (max 1 2.5)
       ⇒ 2.5
  (max 1 3 2.5)
       ⇒ 3

– Function: min number-or-marker &rest numbers-or-markers
    This function returns the smallest of its arguments.

  (min -4 1)
       ⇒ -4

– Function: abs number
    This function returns the absolute value of NUMBER.

7.5. Numeric Conversions

To convert an integer to floating point, use the function ‘float’.

;; This returns NUMBER converted to floating point.  If NUMBER is
;; already floating point, ‘float’ returns it unchanged.
(float 1) ;; 1.0

There are four functions to convert floating-point numbers to integers; they differ in how they round. All accept an argument NUMBER and an optional argument DIVISOR. Both arguments may be integers or floating-point numbers. DIVISOR may also be ‘nil’. If DIVISOR is ‘nil’ or omitted, these functions convert NUMBER to an integer, or return it unchanged if it already is an integer. If DIVISOR is non-‘nil’, they divide NUMBER by DIVISOR and convert the result to an integer. If DIVISOR is zero (whether integer or floating point), Emacs signals an ‘arith-error’ error.

(truncate 1.2) ;; 1
(truncate 7.2 3) ;; 2
(truncate -1.2) ;; -1
(floor 1.2) ;; 1
(floor -1.2) ;; -2
(ceiling 1.2) ;; 2
(ceiling -1.2) ;; -1
(round 1.2) ;; 1
(round -1.2) ;; -1
(round 7 3) ;; 2

7.6. Arithmetic Operations

Plus 1.

(1+ 1) ;; 2

Minus 1.

(1- 1) ;; 0

(+) ;; 0
(+ 1) ;; 1
(-) ;; 0
(- 10) ;; -10
(- 10 1) ;; 9
(*) ;; 1
(* 10) ;; 10
(* 10 2 2) ;; 40
(* 10 2 2.0) ;; 40.0
(/ 6 2) ;; 3
(/ 6 2.0) ;; 3.0
(/ 4) ;; 0

;; remainder
(% 10 2) ;; 0
(% 11 2) ;; 1
(% 11.0 2) ;; wrong-type-argument number-or-marker-p 11.0

(mod 9 4) ;; 1
(mod 9.0 4) ;; 1.0

7.7. (float) Rounding Operations

The functions ‘ffloor’, ‘fceiling’, ‘fround’, and ‘ftruncate’ take a
floating-point argument and return a floating-point result whose value
is a nearby integer. ‘ffloor’ returns the nearest integer below;
‘fceiling’, the nearest integer above; ‘ftruncate’, the nearest integer
in the direction towards zero; ‘fround’, the nearest integer.

– Function: ffloor float
     This function rounds FLOAT to the next lower integral value, and
     returns that value as a floating-point number.

– Function: fceiling float
     This function rounds FLOAT to the next higher integral value, and
     returns that value as a floating-point number.

– Function: ftruncate float
     This function rounds FLOAT towards zero to an integral value, and
     returns that value as a floating-point number.

– Function: fround float
     This function rounds FLOAT to the nearest integral value, and
     returns that value as a floating-point number. Rounding a value
     equidistant between two integers returns the even integer.

7.8. Bitwise Operations on Integers

Won't touch this here at the moment.

3.8 Bitwise Operations on Integers

7.9. Standard Math Functions

– Function: sin arg
– Function: cos arg
– Function: tan arg
     These are the basic trigonometric functions, with argument ARG
     measured in radians.

– Function: asin arg
     The value of ‘(asin ARG)’ is a number between −pi/2 and pi/2
     (inclusive) whose sine is ARG. If ARG is out of range (outside
     [−1, 1]), ‘asin’ returns a NaN.

– Function: acos arg
     The value of ‘(acos ARG)’ is a number between 0 and pi (inclusive)
     whose cosine is ARG. If ARG is out of range (outside [−1, 1]),
     ‘acos’ returns a NaN.

– Function: atan y &optional x
     The value of ‘(atan Y)’ is a number between −pi/2 and pi/2
     (exclusive) whose tangent is Y. If the optional second argument X
     is given, the value of ‘(atan y x)’ is the angle in radians between
     the vector ‘[X, Y]’ and the ‘X’ axis.

– Function: exp arg
     This is the exponential function; it returns e to the power ARG.

– Function: log arg &optional base
     This function returns the logarithm of ARG, with base BASE. If you
     don’t specify BASE, the natural base e is used. If ARG or BASE is
     negative, ‘log’ returns a NaN.

– Function: expt x y
     This function returns X raised to power Y. If both arguments are
     integers and Y is nonnegative, the result is an integer; in this
     case, overflow signals an error, so watch out. If X is a finite
     negative number and Y is a finite non-integer, ‘expt’ returns a
     NaN.

– Function: sqrt arg
     This returns the square root of ARG. If ARG is finite and less
     than zero, ‘sqrt’ returns a NaN.

   In addition, Emacs defines the following common mathematical
constants:

– Variable: float-e
     The mathematical constant e (2.71828…).

– Variable: float-pi
     The mathematical constant pi (3.14159…).

7.10. Random numbers

;; choose a seed based on the string contents
(random "")
;; If LIMIT is ‘t’, it means to choose a new seed as if Emacs were
;; restarting, typically from the system entropy.  On systems lacking
;; entropy pools, choose the seed from less-random volatile data such
;; as the current time.
(random t)

;; If LIMIT is a positive fixnum, the value is chosen to be
;; nonnegative and less than LIMIT.
(random 100)
;; Otherwise, the value might be any 
;; fixnum, i.e., any integer from ‘most-negative-fixnum’ through
;; ‘most-positive-fixnum’.
(random -100)
;; so the following two have the same effect?
(random -100)
(random)

8. String and Character Basics

In Emacs Lisp, characters are simply integers; whether an integer is a character or not is determined only by how it is used.

(aref "abc" 0) ;; 97 (#o141, #x61, ?a)

8.1. Predicates for Strings

– Function: stringp object
    This function returns ‘t’ if OBJECT is a string, ‘nil’ otherwise.

– Function: string-or-null-p object
    This function returns ‘t’ if OBJECT is a string or ‘nil’. It
    returns ‘nil’ otherwise.

– Function: char-or-string-p object
    This function returns ‘t’ if OBJECT is a string or a character
    (i.e., an integer), ‘nil’ otherwise.

(char-or-string-p 97) ;; t, because this is character a
(char-or-string-p "a") ;; t

8.2. Creating Strings

List of useful functions.

– Function: make-string count character &optional multibyte
    This function returns a string made up of COUNT repetitions of
    CHARACTER. If COUNT is negative, an error is signaled.

  (make-string 5 ?x)
       ⇒ "xxxxx"
  (make-string 0 ?x)
       ⇒ ""

    Normally, if CHARACTER is an ASCII character, the result is a
    unibyte string. But if the optional argument MULTIBYTE is
    non-‘nil’, the function will produce a multibyte string instead.
    This is useful when you later need to concatenate the result with
    non-ASCII strings or replace some of its characters with non-ASCII
    characters.

    Other functions to compare with this one include ‘make-vector’
    (*note Vectors::) and ‘make-list’ (*note Building Lists::).

– Function: string &rest characters
    This returns a string containing the characters CHARACTERS.

  (string ?a ?b ?c)
       ⇒ "abc"

– Function: substring string &optional start end
    This function returns a new string which consists of those
    characters from STRING in the range from (and including) the
    character at the index START up to (but excluding) the character at
    the index END. The first character is at index zero. With one
    argument, this function just copies STRING.

  (substring "abcdefg" 0 3)
       ⇒ "abc"

    In the above example, the index for ‘a’ is 0, the index for ‘b’ is
    1, and the index for ‘c’ is 2. The index 3—which is the fourth
    character in the string—marks the character position up to which
    the substring is copied. Thus, ‘abc’ is copied from the string
    ‘"abcdefg"’.

    A negative number counts from the end of the string, so that −1
    signifies the index of the last character of the string. For
    example:

  (substring "abcdefg" -3 -1)
       ⇒ "ef"

    In this example, the index for ‘e’ is −3, the index for ‘f’ is −2,
    and the index for ‘g’ is −1. Therefore, ‘e’ and ‘f’ are included,
    and ‘g’ is excluded.

    When ‘nil’ is used for END, it stands for the length of the string.
    Thus,

  (substring "abcdefg" -3 nil)
       ⇒ "efg"

    Omitting the argument END is equivalent to specifying ‘nil’. It
    follows that ‘(substring STRING 0)’ returns a copy of all of
    STRING.

  (substring "abcdefg" 0)
       ⇒ "abcdefg"

    But we recommend ‘copy-sequence’ for this purpose (*note Sequence
    Functions::).

    If the characters copied from STRING have text properties, the
    properties are copied into the new string also. *Note Text
    Properties::.

    ‘substring’ also accepts a vector for the first argument. For
    example:

  (substring [a b (c) "d"] 1 3)
       ⇒ [b (c)]

    A ‘wrong-type-argument’ error is signaled if START is not an
    integer or if END is neither an integer nor ‘nil’. An
    ‘args-out-of-range’ error is signaled if START indicates a
    character following END, or if either integer is out of range for
    STRING.

    Contrast this function with ‘buffer-substring’ (*note Buffer
    Contents::), which returns a string containing a portion of the
    text in the current buffer. The beginning of a string is at index
    0, but the beginning of a buffer is at index 1.

– Function: substring-no-properties string &optional start end
    This works like ‘substring’ but discards all text properties from
    the value. Also, START may be omitted or ‘nil’, which is
    equivalent to 0. Thus, ‘(substring-no-properties STRING)’ returns
    a copy of STRING, with all text properties removed.

– Function: concat &rest sequences
    This function returns a string consisting of the characters in the
    arguments passed to it (along with their text properties, if any).
    The arguments may be strings, lists of numbers, or vectors of
    numbers; they are not themselves changed. If ‘concat’ receives no
    arguments, it returns an empty string.

  (concat "abc" "-def")
       ⇒ "abc-def"
  (concat "abc" (list 120 121) [122])
       ⇒ "abcxyz"
  ;; ‘nil’ is an empty sequence.
  (concat "abc" nil "-def")
       ⇒ "abc-def"
  (concat "The " "quick brown " "fox.")
       ⇒ "The quick brown fox."
  (concat)
       ⇒ ""

    This function does not always allocate a new string. Callers are
    advised not rely on the result being a new string nor on it being
    ‘eq’ to an existing string.

    In particular, mutating the returned value may inadvertently change
    another string, alter a constant string in the program, or even
    raise an error. To obtain a string that you can safely mutate, use
    ‘copy-sequence’ on the result.

    For information about other concatenation functions, see the
    description of ‘mapconcat’ in *note Mapping Functions::, ‘vconcat’
    in *note Vector Functions::, and ‘append’ in *note Building
    Lists::. For concatenating individual command-line arguments into
    a string to be used as a shell command, see *note
    combine-and-quote-strings: Shell Arguments.

– Function: split-string string &optional separators omit-nulls trim
    This function splits STRING into substrings based on the regular
    expression SEPARATORS (*note Regular Expressions::). Each match
    for SEPARATORS defines a splitting point; the substrings between
    splitting points are made into a list, which is returned.

    If SEPARATORS is ‘nil’ (or omitted), the default is the value of
    ‘split-string-default-separators’ and the function behaves as if
    OMIT-NULLS were ‘t’.

    If OMIT-NULLS is ‘nil’ (or omitted), the result contains null
    strings whenever there are two consecutive matches for SEPARATORS,
    or a match is adjacent to the beginning or end of STRING. If
    OMIT-NULLS is ‘t’, these null strings are omitted from the result.

    If the optional argument TRIM is non-‘nil’, it should be a regular
    expression to match text to trim from the beginning and end of each
    substring. If trimming makes the substring empty, it is treated as
    null.

    If you need to split a string into a list of individual
    command-line arguments suitable for ‘call-process’ or
    ‘start-process’, see *note split-string-and-unquote: Shell
    Arguments.

    Examples:

  (split-string " two words ")
       ⇒ ("two" "words")

    The result is not ‘("" "two" "words" "")’, which would rarely be
    useful. If you need such a result, use an explicit value for
    SEPARATORS:

  (split-string " two words "
         split-string-default-separators)
       ⇒ ("" "two" "words" "")

  (split-string "Soup is good food" "o")
       ⇒ ("S" "up is g" "" "d f" "" "d")
  (split-string "Soup is good food" "o" t)
       ⇒ ("S" "up is g" "d f" "d")
  (split-string "Soup is good food" "o+")
       ⇒ ("S" "up is g" "d f" "d")

    Empty matches do count, except that ‘split-string’ will not look
    for a final empty match when it already reached the end of the
    string using a non-empty match or when STRING is empty:

  (split-string "aooob" "o*")
       ⇒ ("" "a" "" "b" "")
  (split-string "ooaboo" "o*")
       ⇒ ("" "" "a" "b" "")
  (split-string "" "")
       ⇒ ("")

    However, when SEPARATORS can match the empty string, OMIT-NULLS is
    usually ‘t’, so that the subtleties in the three previous examples
    are rarely relevant:

  (split-string "Soup is good food" "o*" t)
       ⇒ ("S" "u" "p" " " "i" "s" " " "g" "d" " " "f" "d")
  (split-string "Nice doggy!" "" t)
       ⇒ ("N" "i" "c" "e" " " "d" "o" "g" "g" "y" "!")
  (split-string "" "" t)
       ⇒ nil

    Somewhat odd, but predictable, behavior can occur for certain
    “non-greedy” values of SEPARATORS that can prefer empty matches
    over non-empty matches. Again, such values rarely occur in
    practice:

  (split-string "ooo" "o*" t)
       ⇒ nil
  (split-string "ooo" "\\|o+" t)
       ⇒ ("o" "o" "o")

– Variable: split-string-default-separators
    The default value of SEPARATORS for ‘split-string’. Its usual
    value is ‘"[ \f\t\n\r\v]+"’.

8.3. Modifying strings

Useful functions.

(setq foo "abc")
(aset foo 1 ?c) ;; c
(message foo) ;; acc


(setq foo "abc")
(store-substring foo 0 "bb") ;; bbc
(message foo) ;; bbc


(setq foo "abc")
(clear-string foo) ;; nil

8.4. Comparison of Characters and Strings

char-equal.

(char-equal ?x ?x) ;; t
(char-equal ?x ?X) ;; t

;; should not ignore case
;; `case-fold-search`
(let ((case-fold-search nil))
  (char-equal ?x ?X)) ;; nil

string=

(string= "abc" "abc") ;; t

;; Case is always significant
;; regardless of `case-fold-search`
(string= "abc" "Abc") ;; nil


;; Symbols are also allowed as arguments, in which
;; case the symbol names are used.
(setq foo "abc")
(string= foo "abc") ;; t

;; create a symbol and assign "test" to its name cell.
(setq sym-test-random-var-name (make-symbol "test"))
(string= sym-test-random-var-name "test") ;; t

;; alias of string=
(string-equal "a" "a")

;; compares two strings using lexicographic order
(string< "abc" "abd") ;; t
(string< "abd" "abd") ;; nil
(string< "abe" "abd") ;; nil
(string< "be" "abd") ;; nil
;; alias for string<
(string-lessp "a" "b") ;; t
(string-greaterp "a" "b") ;; nil

(string-prefix-p "a" "ab") ;; t
(string-prefix-p "A" "ab") ;; nil
;; do not ignore case
(string-prefix-p "A" "ab" nil) ;; nil
(string-prefix-p "A" "ab" t) ;; t

(string-suffix-p "a" "ba") ;; t
(string-suffix-p "A" "ba") ;; nil
;; do not ignore case
(string-suffix-p "A" "ba" nil) ;; nil
(string-suffix-p "A" "ba" t) ;; t


;; “Levenshtein distance”
(string-distance "ab" "abc") ;; 1
(string-distance "Ab" "abc") ;; 2

Other functions.

– Function: string-collate-lessp string1 string2 &optional locale
          ignore-case
     This function returns ‘t’ if STRING1 is less than STRING2 in
     collation order. A collation order is not only determined by the
     lexicographic order of the characters contained in STRING1 and
     STRING2, but also further rules about relations between these
     characters. Usually, it is defined by the LOCALE environment Emacs
     is running with.

     For example, punctuation and whitespace characters might be ignored
     for sorting (*note Sequence Functions::):

          (sort (list "11" "12" "1 1" "1 2" "1.1" "1.2") 'string-collate-lessp)
               ⇒ ("11" "1 1" "1.1" "12" "1 2" "1.2")

     This behavior is system-dependent; e.g., punctuation and whitespace
     are never ignored on Cygwin, regardless of locale.

     The optional argument LOCALE, a string, overrides the setting of
     your current locale identifier for collation. The value is system
     dependent; a LOCALE ‘"en_US.UTF-8"’ is applicable on POSIX systems,
     while it would be, e.g., ‘"enu_USA.1252"’ on MS-Windows systems.
     The LOCALE value of ‘"POSIX"’ or ‘"C"’ lets ‘string-collate-lessp’
     behave like ‘string-lessp’:

          (sort (list "11" "12" "1 1" "1 2" "1.1" "1.2")
                (lambda (s1 s2) (string-collate-lessp s1 s2 "POSIX")))
               ⇒ ("1 1" "1 2" "1.1" "1.2" "11" "12")

     If IGNORE-CASE is non-‘nil’, characters are converted to lower-case
     before comparing them.

     To emulate Unicode-compliant collation on MS-Windows systems, bind
     ‘w32-collate-ignore-punctuation’ to a non-‘nil’ value, since the
     codeset part of the locale cannot be ‘"UTF-8"’ on MS-Windows.

     If your system does not support a locale environment, this function
     behaves like ‘string-lessp’.

– Function: string-version-lessp string1 string2
     This function compares strings lexicographically, except it treats
     sequences of numerical characters as if they comprised a base-ten
     number, and then compares the numbers. So ‘foo2.png’ is “smaller”
     than ‘foo12.png’ according to this predicate, even if ‘12’ is
     lexicographically “smaller” than ‘2’.

– Function: compare-strings string1 start1 end1 string2 start2 end2
          &optional ignore-case
     This function compares a specified part of STRING1 with a specified
     part of STRING2. The specified part of STRING1 runs from index
     START1 (inclusive) up to index END1 (exclusive); ‘nil’ for START1
     means the start of the string, while ‘nil’ for END1 means the
     length of the string. Likewise, the specified part of STRING2 runs
     from index START2 up to index END2.

     The strings are compared by the numeric values of their characters.
     For instance, STR1 is considered less than STR2 if its first
     differing character has a smaller numeric value. If IGNORE-CASE is
     non-‘nil’, characters are converted to upper-case before comparing
     them. Unibyte strings are converted to multibyte for comparison
     (*note Text Representations::), so that a unibyte string and its
     conversion to multibyte are always regarded as equal.

     If the specified portions of the two strings match, the value is
     ‘t’. Otherwise, the value is an integer which indicates how many
     leading characters agree, and which string is less. Its absolute
     value is one plus the number of characters that agree at the
     beginning of the two strings. The sign is negative if STRING1 (or
     its specified portion) is less.
– Function: assoc-string key alist &optional case-fold
     This function works like ‘assoc’, except that KEY must be a string
     or symbol, and comparison is done using ‘compare-strings’. Symbols
     are converted to strings before testing. If CASE-FOLD is
     non-‘nil’, KEY and the elements of ALIST are converted to
     upper-case before comparison. Unlike ‘assoc’, this function can
     also match elements of the alist that are strings or symbols rather
     than conses. In particular, ALIST can be a list of strings or
     symbols rather than an actual alist. *Note Association Lists::.

8.5. Conversion of Characters and Strings

(prin1-to-string (list 'a 'b 'c))

(number-to-string 256) ;; "256"
(string-to-number "25") ;; 25
(string-to-number "25f") ;; 25
(string-to-number "25 is a number") ;; 25
(string-to-number "f25") ;; 0

;; This function is semi-obsolete because the function
;; ‘string’ is more general.
(char-to-string ?a) ;; "a"
(string ?a) ;; "a"

;; This function returns the first character in STRING.  This mostly
;; identical to ‘(aref string 0)’, except that it returns 0 if the
;; string is empty.
(string-to-char "abc") ;; 97

Other useful functions.

‘concat’
     This function converts a vector or a list into a string. *Note
     Creating Strings::.

‘vconcat’
     This function converts a string into a vector. *Note Vector
     Functions::.

‘append’
     This function converts a string into a list. *Note Building
     Lists::.

‘byte-to-string’
     This function converts a byte of character data into a unibyte
     string. *Note Converting Representations::.

8.6. Formatting Strings

Keeping this part here more for reference.

“Formatting” means constructing a string by substituting computed values
at various places in a constant string. This constant string controls
how the other values are printed, as well as where they appear; it is
called a “format string”.

   Formatting is often useful for computing messages to be displayed.
In fact, the functions ‘message’ and ‘error’ provide the same formatting
feature described here; they differ from ‘format-message’ only in how
they use the result of formatting.

– Function: format string &rest objects
     This function returns a string equal to STRING, replacing any
     format specifications with encodings of the corresponding OBJECTS.
     The arguments OBJECTS are the computed values to be formatted.

     The characters in STRING, other than the format specifications, are
     copied directly into the output, including their text properties,
     if any. Any text properties of the format specifications are
     copied to the produced string representations of the argument
     OBJECTS.

     The output string need not be newly-allocated. For example, if ‘x’
     is the string ‘"foo"’, the expressions ‘(eq x (format x))’ and ‘(eq
     x (format "%s" x))’ might both yield ‘t’.

– Function: format-message string &rest objects
     This function acts like ‘format’, except it also converts any grave
     accents (`) and apostrophes (') in STRING as per the value of
     ‘text-quoting-style’.

     Typically grave accent and apostrophe in the format translate to
     matching curved quotes, e.g., "Missing `%s'" might result in
     "Missing ‘foo’". *Note Text Quoting Style::, for how to influence
     or inhibit this translation.

   A format specification is a sequence of characters beginning with a
‘%’. Thus, if there is a ‘%d’ in STRING, the ‘format’ function replaces
it with the printed representation of one of the values to be formatted
(one of the arguments OBJECTS). For example:

     (format "The value of fill-column is %d." fill-column)
          ⇒ "The value of fill-column is 72."

   Since ‘format’ interprets ‘%’ characters as format specifications,
you should never pass an arbitrary string as the first argument. This
is particularly true when the string is generated by some Lisp code.
Unless the string is known to never include any ‘%’ characters, pass
‘"%s"’, described below, as the first argument, and the string as the
second, like this:

       (format "%s" ARBITRARY-STRING)

   Certain format specifications require values of particular types. If
you supply a value that doesn’t fit the requirements, an error is
signaled.

   Here is a table of valid format specifications:

‘%s’
     Replace the specification with the printed representation of the
     object, made without quoting (that is, using ‘princ’, not
     ‘prin1’—*note Output Functions::). Thus, strings are represented
     by their contents alone, with no ‘"’ characters, and symbols appear
     without ‘\’ characters.

     If the object is a string, its text properties are copied into the
     output. The text properties of the ‘%s’ itself are also copied,
     but those of the object take priority.

‘%S’
     Replace the specification with the printed representation of the
     object, made with quoting (that is, using ‘prin1’—*note Output
     Functions::). Thus, strings are enclosed in ‘"’ characters, and
     ‘\’ characters appear where necessary before special characters.

‘%o’
     Replace the specification with the base-eight representation of an
     integer. Negative integers are formatted in a platform-dependent
     way. The object can also be a floating-point number that is
     formatted as an integer, dropping any fraction.

‘%d’
     Replace the specification with the base-ten representation of a
     signed integer. The object can also be a floating-point number
     that is formatted as an integer, dropping any fraction.

‘%x’
‘%X’
     Replace the specification with the base-sixteen representation of
     an integer. Negative integers are formatted in a
     platform-dependent way. ‘%x’ uses lower case and ‘%X’ uses upper
     case. The object can also be a floating-point number that is
     formatted as an integer, dropping any fraction.

‘%c’
     Replace the specification with the character which is the value
     given.

‘%e’
     Replace the specification with the exponential notation for a
     floating-point number.

‘%f’
     Replace the specification with the decimal-point notation for a
     floating-point number.

‘%g’
     Replace the specification with notation for a floating-point
     number, using either exponential notation or decimal-point
     notation. The exponential notation is used if the exponent would
     be less than −4 or greater than or equal to the precision (default:
     6). By default, trailing zeros are removed from the fractional
     portion of the result and a decimal-point character appears only if
     it is followed by a digit.

‘%%’
     Replace the specification with a single ‘%’. This format
     specification is unusual in that its only form is plain ‘%%’ and
     that it does not use a value. For example, ‘(format "%% %d" 30)’
     returns ‘"% 30"’.

   Any other format character results in an ‘Invalid format operation’
error.

   Here are several examples, which assume the typical
‘text-quoting-style’ settings:

     (format "The octal value of %d is %o,
              and the hex value is %x." 18 18 18)
          ⇒ "The octal value of 18 is 22,
              and the hex value is 12."

     (format-message
      "The name of this buffer is ‘%s’." (buffer-name))
          ⇒ "The name of this buffer is ‘strings.texi’."

     (format-message
      "The buffer object prints as `%s'." (current-buffer))
          ⇒ "The buffer object prints as ‘strings.texi’."

   By default, format specifications correspond to successive values
from OBJECTS. Thus, the first format specification in STRING uses the
first such value, the second format specification uses the second such
value, and so on. Any extra format specifications (those for which
there are no corresponding values) cause an error. Any extra values to
be formatted are ignored.

   A format specification can have a “field number”, which is a decimal
number immediately after the initial ‘%’, followed by a literal dollar
sign ‘$’. It causes the format specification to convert the argument
with the given number instead of the next argument. Field numbers start
at 1. A format can contain either numbered or unnumbered format
specifications but not both, except that ‘%%’ can be mixed with numbered
specifications.

     (format "%2$s, %3$s, %%, %1$s" "x" "y" "z")
          ⇒ "y, z, %, x"

   After the ‘%’ and any field number, you can put certain “flag
characters”.

   The flag ‘+’ inserts a plus sign before a nonnegative number, so that
it always has a sign. A space character as flag inserts a space before
a nonnegative number. (Otherwise, nonnegative numbers start with the
first digit.) These flags are useful for ensuring that nonnegative and
negative numbers use the same number of columns. They are ignored
except for ‘%d’, ‘%e’, ‘%f’, ‘%g’, and if both flags are used, ‘+’ takes
precedence.

   The flag ‘#’ specifies an alternate form which depends on the format
in use. For ‘%o’, it ensures that the result begins with a ‘0’. For
‘%x’ and ‘%X’, it prefixes nonzero results with ‘0x’ or ‘0X’. For ‘%e’
and ‘%f’, the ‘#’ flag means include a decimal point even if the
precision is zero. For ‘%g’, it always includes a decimal point, and
also forces any trailing zeros after the decimal point to be left in
place where they would otherwise be removed.

   The flag ‘0’ ensures that the padding consists of ‘0’ characters
instead of spaces. This flag is ignored for non-numerical specification
characters like ‘%s’, ‘%S’ and ‘%c’. These specification characters
accept the ‘0’ flag, but still pad with spaces.

   The flag ‘-’ causes any padding inserted by the width, if specified,
to be inserted on the right rather than the left. If both ‘-’ and ‘0’
are present, the ‘0’ flag is ignored.

     (format "%06d is padded on the left with zeros" 123)
          ⇒ "000123 is padded on the left with zeros"

     (format "'%-6d' is padded on the right" 123)
          ⇒ "'123 ' is padded on the right"

     (format "The word '%-7s' actually has %d letters in it."
             "foo" (length "foo"))
          ⇒ "The word 'foo ' actually has 3 letters in it."

   A specification can have a “width”, which is a decimal number that
appears after any field number and flags. If the printed representation
of the object contains fewer characters than this width, ‘format’
extends it with padding. Any padding introduced by the width normally
consists of spaces inserted on the left:

     (format "%5d is padded on the left with spaces" 123)
          ⇒ " 123 is padded on the left with spaces"

If the width is too small, ‘format’ does not truncate the object’s
printed representation. Thus, you can use a width to specify a minimum
spacing between columns with no risk of losing information. In the
following two examples, ‘%7s’ specifies a minimum width of 7. In the
first case, the string inserted in place of ‘%7s’ has only 3 letters,
and needs 4 blank spaces as padding. In the second case, the string
‘"specification"’ is 13 letters wide but is not truncated.

     (format "The word '%7s' has %d letters in it."
             "foo" (length "foo"))
          ⇒ "The word ' foo' has 3 letters in it."
     (format "The word '%7s' has %d letters in it."
             "specification" (length "specification"))
          ⇒ "The word 'specification' has 13 letters in it."

   All the specification characters allow an optional “precision” after
the field number, flags and width, if present. The precision is a
decimal-point ‘.’ followed by a digit-string. For the floating-point
specifications (‘%e’ and ‘%f’), the precision specifies how many digits
following the decimal point to show; if zero, the decimal-point itself
is also omitted. For ‘%g’, the precision specifies how many significant
digits to show (significant digits are the first digit before the
decimal point and all the digits after it). If the precision of %g is
zero or unspecified, it is treated as 1. For ‘%s’ and ‘%S’, the
precision truncates the string to the given width, so ‘%.3s’ shows only
the first three characters of the representation for OBJECT. For other
specification characters, the effect of precision is what the local
library functions of the ‘printf’ family produce.

   If you plan to use ‘read’ later on the formatted string to retrieve a
copy of the formatted value, use a specification that lets ‘read’
reconstruct the value. To format numbers in this reversible way you can
use ‘%s’ and ‘%S’, to format just integers you can also use ‘%d’, and to
format just nonnegative integers you can also use ‘#x%x’ and ‘#o%o’.
Other formats may be problematic; for example, ‘%d’ and ‘%g’ can
mishandle NaNs and can lose precision and type, and ‘#x%x’ and ‘#o%o’
can mishandle negative integers. *Note Input Functions::.

   The functions described in this section accept a fixed set of
specification characters. The next section describes a function
‘format-spec’ which can accept custom specification characters, such as
‘%a’ or ‘%z’.

8.7. Custom Format Strings

This introduced format-spec that performs a similar function to ‘format’, except it operates on format control strings that use arbitrary specification characters.

(setq my-site-info (list (cons ?s system-name) 
                                                 (cons ?t (symbol-name system-type)) 
                                                 (cons ?c system-configuration) 
                                                 (cons ?v emacs-version) 
                                                 (cons ?e invocation-name) 
                                                 (cons ?p (number-to-string (emacs-pid))) 
                                                 (cons ?a user-mail-address) 
                                                 (cons ?n user-full-name)))
(format-spec "%e %v (%c)" my-site-info) ;; "Emacs 27.2 (x86_64-apple-darwin19.6.0)"
(format-spec "%n <%a>" my-site-info) ;; "Yi Wang <[email protected]>"

   A format specification can include any number of the following flag
characters immediately after the ‘%’ to modify aspects of the
substitution.

‘0’
     This flag causes any padding specified by the width to consist of
     ‘0’ characters instead of spaces.

‘-’
     This flag causes any padding specified by the width to be inserted
     on the right rather than the left.

‘<’
     This flag causes the substitution to be truncated on the left to
     the given width, if specified.

‘>’
     This flag causes the substitution to be truncated on the right to
     the given width, if specified.

‘^’
     This flag converts the substituted text to upper case (*note Case
     Conversion::).

‘_’
     This flag converts the substituted text to lower case (*note Case
     Conversion::).

   The result of using contradictory flags (for instance, both upper and
lower case) is undefined.

   As is the case with ‘format’, a format specification can include a
width, which is a decimal number that appears after any flags. If a
substitution contains fewer characters than its specified width, it is
padded on the left:

     (format-spec "%8a is padded on the left with spaces"
                  '((?a . "alpha")))
          ⇒ " alpha is padded on the left with spaces"

Here is a more complicated example that combines several aforementioned features:

(setq my-battery-info (list (cons ?p "73")      ; Percentage
                                                        (cons ?L "Battery") ; Status
                                                        (cons ?t "2:23")    ; Remaining time
                                                        (cons ?c "24330")   ; Capacity
                                                        (cons ?r "10.6")))  ; Rate of discharge

(format-spec "%>^-3L : %3p%% (%05t left)" my-battery-info)

(format-spec "%>^-3L : %3p%% (%05t left)"
            (cons (cons ?L "AC")
                my-battery-info))

8.8. Case Conversion in Lisp

The character case functions change the case of single characters or of the contents of strings. The functions normally convert only alphabetic characters (the letters ‘A’ through ‘Z’ and ‘a’ through ‘z’, as well as non-ASCII letters); other characters are not altered.

(downcase ?X) ;; 120
(upcase ?x) ;; 88
(capitalize "test the function") ;; Test The Function
(capitalize "TEST THE FUNCTION") ;; Test The Function
(upcase-initials "test THE function") ;; Test THE Function

Note that case conversion is not a one-to-one mapping of codepoints
and length of the result may differ from length of the argument.
Furthermore, because passing a character forces return type to be a
character, functions are unable to perform proper substitution and
result may differ compared to treating a one-character string. For
example:

     (upcase "ﬁ") ; note: single character, ligature "fi"
          ⇒ "FI"
     (upcase ?ﬁ)
          ⇒ 64257 ; i.e. ?ﬁ

   To avoid this, a character must first be converted into a string,
using ‘string’ function, before being passed to one of the casing
functions. Of course, no assumptions on the length of the result may be
made.

   Mapping for such special cases are taken from ‘special-uppercase’,
‘special-lowercase’ and ‘special-titlecase’ *Note Character
Properties::.

   *Note Text Comparison::, for functions that compare strings; some of
them ignore case differences, or can optionally ignore case differences.

8.9. The Case Table

This is not very related because I only use English and Chinese at the moment. I'll only check this when the need arises.

You can customize case conversion by installing a special “case table”.
A case table specifies the mapping between upper case and lower case
letters. It affects both the case conversion functions for Lisp objects
(see the previous section) and those that apply to text in the buffer
(*note Case Changes::). Each buffer has a case table; there is also a
standard case table which is used to initialize the case table of new
buffers.

   A case table is a char-table (*note Char-Tables::) whose subtype is
‘case-table’. This char-table maps each character into the
corresponding lower case character. It has three extra slots, which
hold related tables:

UPCASE
     The upcase table maps each character into the corresponding upper
     case character.
CANONICALIZE
     The canonicalize table maps all of a set of case-related characters
     into a particular member of that set.
EQUIVALENCES
     The equivalences table maps each one of a set of case-related
     characters into the next character in that set.

   In simple cases, all you need to specify is the mapping to
lower-case; the three related tables will be calculated automatically
from that one.

   For some languages, upper and lower case letters are not in
one-to-one correspondence. There may be two different lower case
letters with the same upper case equivalent. In these cases, you need
to specify the maps for both lower case and upper case.

   The extra table CANONICALIZE maps each character to a canonical
equivalent; any two characters that are related by case-conversion have
the same canonical equivalent character. For example, since ‘a’ and ‘A’
are related by case-conversion, they should have the same canonical
equivalent character (which should be either ‘a’ for both of them, or
‘A’ for both of them).

   The extra table EQUIVALENCES is a map that cyclically permutes each
equivalence class (of characters with the same canonical equivalent).
(For ordinary ASCII, this would map ‘a’ into ‘A’ and ‘A’ into ‘a’, and
likewise for each set of equivalent characters.)

   When constructing a case table, you can provide ‘nil’ for
CANONICALIZE; then Emacs fills in this slot from the lower case and
upper case mappings. You can also provide ‘nil’ for EQUIVALENCES; then
Emacs fills in this slot from CANONICALIZE. In a case table that is
actually in use, those components are non-‘nil’. Do not try to specify
EQUIVALENCES without also specifying CANONICALIZE.

   Here are the functions for working with case tables:

– Function: case-table-p object
     This predicate returns non-‘nil’ if OBJECT is a valid case table.

– Function: set-standard-case-table table
     This function makes TABLE the standard case table, so that it will
     be used in any buffers created subsequently.

– Function: standard-case-table
     This returns the standard case table.

– Function: current-case-table
     This function returns the current buffer’s case table.

– Function: set-case-table table
     This sets the current buffer’s case table to TABLE.

– Macro: with-case-table table body…
     The ‘with-case-table’ macro saves the current case table, makes
     TABLE the current case table, evaluates the BODY forms, and finally
     restores the case table. The return value is the value of the last
     form in BODY. The case table is restored even in case of an
     abnormal exit via ‘throw’ or error (*note Nonlocal Exits::).

   Some language environments modify the case conversions of ASCII
characters; for example, in the Turkish language environment, the ASCII
capital I is downcased into a Turkish dotless i (‘ı’). This can
interfere with code that requires ordinary ASCII case conversion, such
as implementations of ASCII-based network protocols. In that case, use
the ‘with-case-table’ macro with the variable ASCII-CASE-TABLE, which
stores the unmodified case table for the ASCII character set.

– Variable: ascii-case-table
     The case table for the ASCII character set. This should not be
     modified by any language environment settings.

   The following three functions are convenient subroutines for packages
that define non-ASCII character sets. They modify the specified case
table CASE-TABLE; they also modify the standard syntax table. *Note
Syntax Tables::. Normally you would use these functions to change the
standard case table.

– Function: set-case-syntax-pair uc lc case-table
     This function specifies a pair of corresponding letters, one upper
     case and one lower case.

– Function: set-case-syntax-delims l r case-table
     This function makes characters L and R a matching pair of
     case-invariant delimiters.

– Function: set-case-syntax char syntax case-table
     This function makes CHAR case-invariant, with syntax SYNTAX.

– Command: describe-buffer-case-table
     This command displays a description of the contents of the current
     buffer’s case table.

9. Lists

9.1. Lists and Cons Cells

A list is a series of cons cells chained together, so that each cell
refers to the next one. There is one cons cell for each element of the
list. By convention, the CARs of the cons cells hold the elements of
the list, and the CDRs are used to chain the list (this asymmetry
between CAR and CDR is entirely a matter of convention; at the level of
cons cells, the CAR and CDR slots have similar properties). Hence, the
CDR slot of each cons cell in a list refers to the following cons cell.

Also by convention, the CDR of the last cons cell in a list is ‘nil’. We call such a ‘nil’-terminated structure a “proper list”. In Emacs Lisp, the symbol ‘nil’ is both a symbol and a list with no elements. For convenience, the symbol ‘nil’ is considered to have ‘nil’ as its CDR (and also as its CAR). The CDR of a proper list is always a proper list. The CDR of a nonempty proper list is a proper list containing all the elements except the first.

9.2. Predicates on Lists

;; This function returns ‘t’ if OBJECT is a cons cell, ‘nil’ otherwise. 
(consp '('test . 'random)) ;; t
;; ‘nil’ is not a cons cell, although it _is_ a list.
(consp nil) ;; nil

;; This function returns ‘t’ if OBJECT is an atom, ‘nil’ otherwise.
(atom "t") ;; t
(atom 1) ;; t
;; All objects except cons cells are atoms.  The symbol ‘nil’ is an
;; atom and is also a list; it is the only Lisp object that is both.
(atom nil) ;; t
(listp nil) ;; t
;; (atom OBJECT) ≡ (not (consp OBJECT))


;; This function returns ‘t’ if OBJECT is a cons cell or ‘nil’.
;; Otherwise, it returns ‘nil’.
(listp nil) ;; t
(listp '(1)) ;; t
(listp '()) ;; t


;; This function is the opposite of ‘listp’: it returns ‘t’ if OBJECT
;; is not a list.  Otherwise, it returns ‘nil’.
(nlistp '(1)) ;; nil
(nlistp nil) ;; nil
;; (listp OBJECT) ≡ (not (nlistp OBJECT))


;; This function returns ‘t’ if OBJECT is ‘nil’, and returns ‘nil’ otherwise
;; This function is identical to ‘not’, but as a matter of
;; clarity we use ‘null’ when OBJECT is considered a list and ‘not’
;; when it is considered a truth value
(null nil) ;; t


;; This function returns the length of OBJECT if it is a proper list,
;; 'nil' otherwise
(proper-list-p '(a b c)) ;; 3
(proper-list-p '(a b . c)) ;; nil

9.3. Accessing Elements of Lists

Basic operations.

(car '((a . b)  c)) ;; (a . b)
(car nil)           ;; nil
(car '())           ;; nil

(cdr '((a . b)  c)) ;; (c)
(cdr nil)           ;; nil
(cdr '())           ;; nil

(car-safe 1) ;; nil
(car-safe '(a b)) ;; a
(listp (car-safe '(a b))) ;; nil

;; (car-safe OBJECT)
;; ≡
;; (let ((x OBJECT))
;;   (if (consp x) 
;;       (car x) nil))

(cdr-safe 1) ;; nil
(cdr-safe '(a b)) ;; (b)
(listp (cdr-safe '(a b))) ;; t
;; (cdr-safe OBJECT)
;; ≡
;; (let ((x OBJECT))
;;   (if (consp x) 
;;       (cdr x) nil))

Note the following about car and cdr. This is because for the object '(a b), its CAR is an atom a, while its CDR is always list whose CAR is b.

(listp (car '(a b))) ;; nil

(listp (cdr '(a b))) ;; t

(listp (car (cdr '(a b)))) ;; nil

(listp (cdr '(a . b))) ;; nil

Continuation of basic operations.

(setq x (list 1 "test" ?a))
;; This is a macro
;; This macro provides a convenient way to examine the CAR of a list,
;; and take it off the list, all at once.  It operates on the list
;; stored in LISTNAME.  It removes the first element from the list,
;; saves the CDR into LISTNAME, then returns the removed element.
;; 
;; In the simplest case, LISTNAME is an unquoted symbol naming a list;
;; in that case, this macro is equivalent to
;; ‘(prog1 (car listname) (setq listname (cdr listname)))’.

(pop x) ;; 1
(progn x) ;; ("test" 97)

(setq x (list 1 "test" ?a))

;; This function returns the Nth element of LIST.  Elements are
;; numbered starting with zero, so the CAR of LIST is element number
;; zero.  If the length of LIST is N or less, the value is ‘nil’.
(nth 0 x) ;; 1
(nth 1000 x) ;; nil
;; (nth n x) ≡ (car (nthcdr n x))


;; This function returns the Nth CDR of LIST.  In other words, it
;; skips past the first N links of LIST and returns what follows.

;; If N is zero, ‘nthcdr’ returns all of LIST.  If the length of LIST
;; is N or less, ‘nthcdr’ returns ‘nil’.
(nthcdr 1 x) ;; ("test" 97)
(nthcdr 0 x) ;; (1 "test" 97)
(nthcdr 10000 x) ;; nil


;; This function returns the last link of LIST.  The ‘car’ of this
;; link is the list’s last element.  If LIST is null, ‘nil’ is
;; returned.  If N is non-‘nil’, the Nth-to-last link is returned
;; instead, or the whole of LIST if N is bigger than LIST’s length.
(last x) ;; (97)
(last x 0) ;; nil
(last x 1) ;; (97)
(last x 10) ;; (1 "test" 97)

;; This function returns the length of LIST, with no risk of either an
;; error or an infinite loop.  It generally returns the number of
;; distinct cons cells in the list.  However, for circular lists, the
;; value is just an upper bound; it is often too large.

;; If LIST is not ‘nil’ or a cons cell, ‘safe-length’ returns 0.
(safe-length x) ;; 3
;; only one cons cell
(safe-length '(a . b)) ;; 1
(safe-length 1) ;; 0

(setq deep-x `(,x (,x ,x)))
;; This is the same as ‘(car (car CONS-CELL))’.
(caar deep-x) ;; 1
;; This is the same as ‘(car (cdr CONS-CELL))’ or ‘(nth 1 CONS-CELL)’.
(cadr deep-x) ;; ((1 "test" 97) (1 "test" 97))

;; In addition to the above, 24 additional compositions of ‘car’ and
;; ‘cdr’ are defined as ‘cXXXr’ and ‘cXXXXr’, where each ‘X’ is either ‘a’
;; or ‘d’.  ‘cadr’, ‘caddr’, and ‘cadddr’ pick out the second, third or
;; fourth elements of a list, respectively.  ‘cl-lib’ provides the same

;; This function returns the list X with the last element, or the last
;; N elements, removed.
(butlast x) ;; (1 "test")
(butlast x 2) ;; (1)
(butlast x 10) ;; nil

;; This is a version of ‘butlast’ that works by destructively
;; modifying the ‘cdr’ of the appropriate element, rather than making
;; a copy of the list.
(nbutlast x) ;; (1 "test")

9.4. Building Cons Cells and Lists

(cons 1 '(2)) ;; (1  2), CAR holds 1 and CDR holds a list (2)
(cons 1 '()) ;; (1), CAR holds 1 and CDR holds nil
(cons 1 2) ;; (1 . 2), CAR holds 1 and CDR holds 2

;; ‘cons’ is often used to add a single element to the front of a
;; list.  This is called “consing the element onto the list”.  (1) For
;; example:

(setq list '(1))
(setq newelt 2)
(setq list (cons newelt list)) ;; (2 1)

;; This function creates a list with OBJECTS as its elements.  The
;; resulting list is always ‘nil’-terminated.  If no OBJECTS are
;; given, the empty list is returned.
(list 1 2 3 4)

;; This function creates a list of LENGTH elements, in which each
;; element is OBJECT.  Compare ‘make-list’ with ‘make-string’ (*note
;; Creating Strings::).
(make-list 3 'sym) ;; (sym sym sym)

9.4.1. Function: append &rest sequences

This function returns a list containing all the elements of SEQUENCES. The SEQUENCES may be lists, vectors, bool-vectors, or strings, but the last one should usually be a list. All arguments except the last one are copied, so none of the arguments is altered. (See ‘nconc’ in *note Rearrangement::, for a way to join lists with no copying.)

More generally, the final argument to ‘append’ may be any Lisp object. The final argument is not copied or converted; it becomes the CDR of the last cons cell in the new list. If the final argument is itself a list, then its elements become in effect elements of the result list. If the final element is not a list, the result is a dotted list since its final CDR is not ‘nil’ as required in a proper list (*note Cons Cells::).

(setq trees '(pine oak)) ;; (pine oak)

(setq more-trees (append '(maple birch) trees)) ;; (maple birch pine oak)
(eq trees (cddr more-trees)) ;; t

You can see how ‘append’ works by looking at a box diagram. The variable ‘trees’ is set to the list ‘(pine oak)’ and then the variable ‘more-trees’ is set to the list ‘(maple birch pine oak)’. However, the variable ‘trees’ continues to refer to the original list:

     more-trees                trees
     |                           |
     |     --- ---      --- ---   -> --- ---      --- ---
      --> |   |   |--> |   |   |--> |   |   |--> |   |   |--> nil
           --- ---      --- ---      --- ---      --- ---
            |            |            |            |
            |            |            |            |
             --> maple    -->birch     --> pine     --> oak

An empty sequence contributes nothing to the value returned by ‘append’. As a consequence of this, a final ‘nil’ argument forces a copy of the previous argument:

(setq trees '(pine oak)) ;; (pine oak)
(setq wood (append trees nil)) ;; (pine oak)
(eq wood trees) ;; nil
(eq (cdr wood) (cdr trees)) ;; nil

Work with Vectors.

(append [a b] "cd" nil) ;; (a b 99 100), append returns a list
(string-to-char "c") ;; 99
(string-to-char "d") ;; 100

Combined with apply.

(apply 'append '((a b c) nil (x y z) nil)) ;; (a b c x y z)

No arguments.

(append) ;; nil

Final argument is not a list.

(append '(x y) 'z) ;; (x y . z)
;; This example shows that when the final argument is a sequence but
;; not a list, the sequence’s elements do not become elements of the
;; resulting list.  Instead, the sequence becomes the final CDR, like any
;; other non-list final argument.
(append '(x y) [z]) ;; (x y . [z])

9.4.2. Function: copy-tree tree &optional vecp

This function returns a copy of the tree TREE. If TREE is a cons cell, this makes a new cons cell with the same CAR and CDR, then recursively copies the CAR and CDR in the same way.

Normally, when TREE is anything other than a cons cell, ‘copy-tree’ simply returns TREE. However, if VECP is non-‘nil’, it copies vectors too (and operates recursively on their elements).

(copy-tree '(x y)) ;; (x y)
(copy-tree '(x . y)) ;; (x . y)
(copy-tree '(x '(xx yy) y)) ;; (x '(xx yy) y)
;; TREE is not a cons cell
(setq tree "test")
(setq new-tree (copy-tree tree))
(eq tree "test") ;; nil
(eq tree new-tree) ;; t

;; turn on vecp
(setq tree [1 2 3])
;; simply return TREE
(setq new-tree (copy-tree tree)) ;; [1 2 3]
(eq tree new-tree) ;; t

;; copy TREE and return
(setq new-tree (copy-tree new-tree t)) ;; [1 2 3]
(eq tree new-tree) ;; nil

9.4.3. Function: flatten-tree tree

This function returns a “flattened” copy of TREE, that is, a list containing all the non-‘nil’ terminal nodes, or leaves, of the tree of cons cells rooted at TREE. Leaves in the returned list are in the same order as in TREE.

(flatten-tree '(1 (2 . 3) nil (4 5 (6)) 7)) ;; (1 2 3 4 5 6 7)

9.4.4. Function: number-sequence from &optional to separation

This function returns a list of numbers starting with FROM and incrementing by SEPARATION, and ending at or just before TO. SEPARATION can be positive or negative and defaults to 1. If TO is ‘nil’ or numerically equal to FROM, the value is the one-element list ‘(FROM)’. If TO is less than FROM with a positive SEPARATION, or greater than FROM with a negative SEPARATION, the value is ‘nil’ because those arguments specify an empty sequence.

If SEPARATION is 0 and TO is neither ‘nil’ nor numerically equal to FROM, ‘number-sequence’ signals an error, since those arguments specify an infinite sequence.

All arguments are numbers. Floating-point arguments can be tricky, because floating-point arithmetic is inexact. For instance, depending on the machine, it may quite well happen that ‘(number-sequence 0.4 0.6 0.2)’ returns the one element list ‘(0.4)’, whereas ‘(number-sequence 0.4 0.8 0.2)’ returns a list with three elements. The Nth element of the list is computed by the exact formula ‘(+ FROM (* N SEPARATION))’. Thus, if one wants to make sure that TO is included in the list, one can pass an expression of this exact type for TO. Alternatively, one can replace TO with a slightly larger value (or a slightly more negative value if SEPARATION is negative).

(number-sequence 4 9) ;; (4 5 6 7 8 9)

9.5. Modifying List Variables

9.5.1. Macro: push element listname

This macro creates a new list whose CAR is ELEMENT and whose CDR is the list specified by LISTNAME, and saves that list in LISTNAME. In the simplest case, LISTNAME is an unquoted symbol naming a list, and this macro is equivalent to ‘(setq LISTNAME (cons ELEMENT LISTNAME))’.

(setq l '(a b)) ;; (a b)
;; inserts to the beginning of the list
(push 'c l) ;; (c a b)
l ;; (c a b)

9.5.2. add-to-list symbol element &optional append compare-fn

This documentation seems to be written by a completely different guy as the tone is priceless.

This function sets the variable SYMBOL by consing ELEMENT onto the old value, if ELEMENT is not already a member of that value. It returns the resulting list, whether updated or not. The value of SYMBOL had better be a list already before the call. ‘add-to-list’ uses COMPARE-FN to compare ELEMENT against existing list members; if COMPARE-FN is ‘nil’, it uses ‘equal’.

Normally, if ELEMENT is added, it is added to the front of SYMBOL, but if the optional argument APPEND is non-‘nil’, it is added at the end.

The argument SYMBOL is not implicitly quoted; ‘add-to-list’ is an ordinary function, like ‘set’ and unlike ‘setq’. Quote the argument yourself if that is what you want.

Do not use this function when SYMBOL refers to a lexical variable.

(setq foo '(a b)) ;; (a b)
(add-to-list 'foo 'c) ;; (c a b)
(add-to-list 'foo 'b) ;; (c a b)
foo ;; (c a b)

9.5.3. add-to-ordered-list symbol element &optional order

This function sets the variable SYMBOL by inserting ELEMENT into the old value, which must be a list, at the position specified by ORDER. If ELEMENT is already a member of the list, its position in the list is adjusted according to ORDER. Membership is tested using ‘eq’. This function returns the resulting list, whether updated or not.

The ORDER is typically a number (integer or float), and the elements of the list are sorted in non-decreasing numerical order.

ORDER may also be omitted or ‘nil’. Then the numeric order of ELEMENT stays unchanged if it already has one; otherwise, ELEMENT has no numeric order. Elements without a numeric list order are placed at the end of the list, in no particular order.

Any other value for ORDER removes the numeric order of ELEMENT if it already has one; otherwise, it is equivalent to ‘nil’.

The argument SYMBOL is not implicitly quoted; ‘add-to-ordered-list’ is an ordinary function, like ‘set’ and unlike ‘setq’. Quote the argument yourself if necessary.

The ordering information is stored in a hash table on SYMBOL’s ‘list-order’ property. SYMBOL cannot refer to a lexical variable.

(setq foo '())
(add-to-ordered-list 'foo 'a 1) ;; (a)
(add-to-ordered-list 'foo 'c 1) ;; (c a)
(add-to-ordered-list 'foo 'b 2) ;; (c a b)
(add-to-ordered-list 'foo 'b 1) ;; (b c a)
foo ;; (b c a)

9.6. Altering List Elements with ‘setcar’

Changing the CAR of a cons cell is done with ‘setcar’. When used on a list, ‘setcar’ replaces one element of a list with a different element.

9.6.1. Function: setcar cons object

This function stores OBJECT as the new CAR of CONS, replacing its previous CAR. In other words, it changes the CAR slot of CONS to refer to OBJECT. It returns the value OBJECT. For example:

(setq x (list 1 2)) ;; (1 2)
(setcar x 4) ;; 4
x ;; (4 2)

When a cons cell is part of the shared structure of several lists, storing a new CAR into the cons changes one element of each of these lists. Here is an example:

;; Create two lists that are partly shared.
(setq x1 (list 'a 'b 'c)) ;; (a b c)
(setq x2 (cons 'z (cdr x1))) ;; (z b c)
(setcar (cdr x1) 'foo) ;; foo
x1 ;; (a foo c)
x2 ;; (z foo c)

;; Replace the CAR of a link that is not shared.
(setcar x1 'baz) ;; baz
x1 ;; (baz foo c)
x2 ;; (z foo c)

Graphical depiction of the code above.

     x1:
      --------------       --------------       --------------
     | car   | cdr  |     | car   | cdr  |     | car   | cdr  |
     |   a   |   o------->|   b   |   o------->|   c   |  nil |
     |       |      |  -->|       |      |     |       |      |
      --------------  |    --------------       --------------
                      |
     x2:              |
      --------------  |
     | car   | cdr  | |
     |   z   |   o----
     |       |      |
      --------------

9.7. Altering the CDR of a List

The lowest-level primitive for modifying a CDR is ‘setcdr’:

9.7.1. Function: setcdr cons object

This function stores OBJECT as the new CDR of CONS, replacing its previous CDR. In other words, it changes the CDR slot of CONS to refer to OBJECT. It returns the value OBJECT.

(setq x (list 1 2 3)) ;; (1 2 3)
(setcdr x '(4)) ;; (4)
x ;; (1 4)

(setcdr x 4) ;; 4
x ;; (1 . 4)

You can delete elements from the middle of a list by altering the CDRs of the cons cells in the list. For example, here we delete the second element, ‘b’, from the list ‘(a b c)’, by changing the CDR of the first cons cell:

(setq x1 (list 'a 'b 'c)) ;; (a b c)
(setcdr x1 (cdr (cdr x1))) ;; (c)
x1 ;; (a c)

Graphical representation of the above:

                        --------------------
                       |                    |
      --------------   |   --------------   |    --------------
     | car   | cdr  |  |  | car   | cdr  |   -->| car   | cdr  |
     |   a   |   o-----   |   b   |   o-------->|   c   |  nil |
     |       |      |     |       |      |      |       |      |
      --------------       --------------        --------------

You can also use this to insert a new element.

(setq x1 (list 'a 'b 'c)) ;; (a b c)
(setcdr x1 `(d ,@(cdr x1))) ;; (d b c)
;; equivalent to the following
;; (setcdr x1 (cons 'd (cdr x1)))
x1 ;; (a d b c)

Graphical representation of the above.

      --------------        -------------       -------------
     | car  | cdr   |      | car  | cdr  |     | car  | cdr  |
     |   a  |   o   |   -->|   b  |   o------->|   c  |  nil |
     |      |   |   |  |   |      |      |     |      |      |
      --------- | --   |    -------------       -------------
                |      |
          -----         --------
         |                      |
         |    ---------------   |
         |   | car   | cdr   |  |
          -->|   d   |   o------
             |       |       |
              ---------------

9.8. Functions that Rearrange Lists

Here are some functions that rearrange lists destructively by modifying the CDRs of their component cons cells. These functions are destructive because they chew up the original lists passed to them as arguments, relinking their cons cells to form a new list that is the returned value.

9.8.1. Function: nconc &rest lists

This function returns a list containing all the elements of LISTS. Unlike ‘append’ (*note 9.4.1), the LISTS are not copied. Instead, the last CDR of each of the LISTS is changed to refer to the following list. The last of the LISTS is not altered.

(setq x (list 1 2 3)) ;; (1 2 3)
(nconc x '(4 5)) ;; (1 2 3 4 5)
x ;; (1 2 3 4 5)

Since the last argument of ‘nconc’ is not itself modified, it is reasonable to use a constant list, such as ‘'(4 5)’, as in the above example. For the same reason, the last argument need not be a list:

(setq x (list 1 2 3)) ;; (1 2 3)
(nconc x 'z) ;; (1 2 3 . z)
x ;; (1 2 3 . z)

However, the other arguments (all but the last) should be mutable lists.

A common pitfall is to use a constant list as a non-last argument to ‘nconc’. If you do this, the resulting behavior is undefined. It is possible that your program will change each time you run it! Here is what might happen (though this is not guaranteed to happen):

(defun add-foo (x)            ; We want this function to add
    (nconc '(foo) x))           ;   ‘foo’ to the front of its arg.
(symbol-function 'add-foo) ;; (lambda (x) (nconc '(foo) x))
(setq xx (add-foo '(1 2))) ;; (foo 1 2)
;; intention is that there will be more 'foo' at the beginning of the list
;; but that is not the case
(setq xx (add-foo '(3 4))) ;; (foo 1 2 3 4)
(setq xx (add-foo '(3 4))) ;; (foo 1 2 3 4 3 4)

9.9. Using Lists as Sets

If you ignore the order of an element in a list, then it becomes a set, though this list could have duplicate elements.

9.9.1. Function: memq object list

Check if OBJECT is a member of LIST by comparing the OBJECT and element of LIST are the same Lisp object. If it is, ‘memq’ returns a list starting with the first occurrence of OBJECT. Otherwise, it returns ‘nil’. The letter ‘q’ in ‘memq’ says that it uses ‘eq’ to compare OBJECT against the elements of the list.

(memq 'b '(a b c b a)) ;; (b c b a)
(setq test '(1 2))
(memq '(1 2) '(1 test)) ;; nil

9.9.2. Function: delq object list

This function destructively removes all elements ‘eq’ to OBJECT from LIST, and returns the resulting list. The letter ‘q’ in ‘delq’ says that it uses ‘eq’ to compare OBJECT against the elements of the list, like ‘memq’ and ‘remq’.

Typically, when you invoke ‘delq’, you should use the return value by assigning it to the variable which held the original list. The reason for this is explained below.

The ‘delq’ function deletes elements from the front of the list by simply advancing down the list, and returning a sublist that starts after those elements. For example:

(delq 'a '(a b c)) ≡ (cdr '(a b c))

(delq 'a '(a b c)) ;; (b c)
(cdr '(a b c)) ;; (b c)

When an element to be deleted appears in the middle of the list, removing it involves changing the CDRs (*note 9.7.1).

(setq sample-list (list 'a 'b 'c '(4))) ;; (a b c (4))
(delq 'a sample-list) ;; (b c (4))
sample-list ;; (a b c (4))
(delq 'c sample-list) ;; (a b (4))
sample-list ;; (a b (4))

Note that ‘(delq 'c sample-list)’ modifies ‘sample-list’ to splice out the third element, but ‘(delq 'a sample-list)’ does not splice anything—it just returns a shorter list. Don’t assume that a variable which formerly held the argument LIST now has fewer elements, or that it still holds the original list! Instead, save the result of ‘delq’ and use that. Most often we store the result back into the variable that held the original list:

(setq flowers '(rose sunflower)) ;; (rose sunflower)
(setq flowers (delq 'rose flowers)) ;; (sunflower)
flowers ;; (sunflower)

In the following example, the ‘(list 4)’ that ‘delq’ attempts to match and the ‘(4)’ in the ‘sample-list’ are ‘equal’ but not ‘eq’:

(delq (list 4) sample-list) ;; (a b (4))
sample-list ;; (a b (4))

9.10. Association Lists

Basically a dictionary in python.

Mapping from keys to values.

It is a list of cons cells called “associations”: the CAR of each cons cell is the “key”, and the CDR is the “associated value”.

Here is an example of an alist. The key ‘pine’ is associated with the value ‘cones’; the key ‘oak’ is associated with ‘acorns’; and the key ‘maple’ is associated with ‘seeds’.

((pine . cones)
 (oak . acorns)
 (maple . seeds))

Both the values and the keys in an alist may be any Lisp objects.* For example, in the following alist, the symbol ‘a’ is associated with the number ‘1’, and the string ‘"b"’ is associated with the list ‘(2 3)’, which is the CDR of the alist element:

((a . 1) ("b" 2 3))

Sometimes it is better to design an alist to store the associated value in the CAR of the CDR of the element. Here is an example of such an alist:

;; red is in the CAR of the CDR of (rose red)
((rose red) (lily white) (buttercup yellow))

Here we regard ‘red’ as the value associated with ‘rose’. One advantage of this kind of alist is that you can store other related information—even a list of other items—in the CDR of the CDR. One disadvantage is that you cannot use ‘rassq’ (see below) to find the element containing a given value. When neither of these considerations is important, the choice is a matter of taste, as long as you are consistent about it for any given alist.

The same alist shown above could be regarded as having the associated value in the CDR of the element; the value associated with ‘rose’ would be the list ‘(red)’.

Association lists are often used to record information that you might otherwise keep on a stack, since new associations may be added easily to the front of the list. When searching an association list for an association with a given key, the first one found is returned, if there is more than one.

In Emacs Lisp, it is not an error if an element of an association list is not a cons cell. The alist search functions simply ignore such elements. Many other versions of Lisp signal errors in such cases.

Note that 9.11 are similar to association lists in several respects. A property list behaves like an association list in which each key can occur only once.

9.10.1. Function: assoc key alist &optional testfn

This function returns the first association for KEY in ALIST, comparing KEY against the alist elements using TESTFN if it is non-‘nil’ and ‘equal’ otherwise (see 5). It returns ‘nil’ if no association in ALIST has a CAR equal to KEY. For example:

(setq trees '((pine . cones) (oak . acorns) (maple . seeds)))

(assoc 'oak trees)

(cdr (assoc 'oak trees))

(assoc 'birch trees)

Here is another example, in which the keys and values are not symbols:

(setq needles-per-cluster
    '((2 "Austrian Pine" "Red Pine")
        (3 "Pitch Pine")
        (5 "White Pine")))

(cdr (assoc 3 needles-per-cluster))
(cdr (assoc 2 needles-per-cluster))

The function ‘assoc-string’ is much like ‘assoc’ except that it ignores certain differences between strings. *Note Text Comparison::.

9.10.2. Function: rassoc value alist

Search value of the mapping. r in the function name means "reverse".

This function returns the first association with value VALUE in ALIST. It returns ‘nil’ if no association in ALIST has a CDR ‘equal’ to VALUE.

(setq needles-per-cluster
    '((2 "Austrian Pine" "Red Pine")
        (3 "Pitch Pine")
        (5 "White Pine")))

(rassoc "Austrian Pine" needles-per-cluster) ;; nil
(rassoc "White Pine" needles-per-cluster) ;; nil
(rassoc '("White Pine") needles-per-cluster) ;; (5 "White Pine")

9.10.3. Function: assq key alist

This function is like ‘assoc’ in that it returns the first association for KEY in ALIST, but it makes the comparison using ‘eq’. ‘assq’ returns ‘nil’ if no association in ALIST has a CAR ‘eq’ to KEY. This function is used more often than ‘assoc’, since ‘eq’ is faster than ‘equal’ and most alists use symbols as keys.

(setq trees '((pine . cones) (oak . acorns) (maple . seeds)))
(assq 'pine trees)

On the other hand, ‘assq’ is not usually useful in alists where the keys may not be symbols:

(setq leaves
    '(("simple leaves" . oak)
        ("compound leaves" . horsechestnut)))
;; this is affected by eq's behaviour
(assq "simple leaves" leaves) ;; nil or ("simple leaves" . oak)
(assoc "simple leaves" leaves) ;; ("simple leaves" . oak)

9.10.4. Function: alist-get key alist &optional default remove testfn

This function is similar to ‘assq’. It finds the first association ‘(KEY . VALUE)’ by comparing KEY with ALIST elements, and, if found, returns the VALUE of that association. If no association is found, the function returns DEFAULT. Comparison of KEY against ALIST elements uses the function specified by TESTFN, defaulting to ‘eq’.

This is a generalized variable (*note Generalized Variables::) that can be used to change a value with ‘setf’. When using it to set a value, optional argument REMOVE non-‘nil’ means to remove KEY’s association from ALIST if the new value is ‘eql’ to DEFAULT.

9.10.5. Function: rassq value alist

This function returns the first association with value VALUE in ALIST. It returns ‘nil’ if no association in ALIST has a CDR ‘eq’ to VALUE.

‘rassq’ is like ‘assq’ except that it compares the CDR of each ALIST association instead of the CAR. You can think of this as reverse ‘assq’, finding the key for a given value.

(setq trees '((pine . cones) (oak . acorns) (maple . seeds)))

(rassq 'acorns trees)
(rassq 'spores trees)

‘rassq’ cannot search for a value stored in the CAR of the CDR of an element:

(setq colors '((rose red) (lily white) (buttercup yellow)))

;; In this case, the CDR of the association ‘(lily white)’ is not the
;; symbol ‘white’, but rather the list ‘(white)’.  This becomes
;; clearer if the association is written in dotted pair notation:
(rassq 'white colors) ;; (lily . whitelist)

(setq white "white color")
(setq whitelist '(white))
(setq colors '((rose red) (lily . whitelist) (buttercup yellow)))
;; this works because the eq function receives the same whitelist object.
(rassq 'whitelist colors) ;; (lily . whitelist)

9.10.6. Function: assoc-default key alist &optional test default

This function searches ALIST for a match for KEY. For each element of ALIST, it compares the element (if it is an atom) or the element’s CAR (if it is a cons) against KEY, by calling TEST with two arguments: the element or its CAR, and KEY. The arguments are passed in that order so that you can get useful results using ‘string-match’ with an alist that contains regular expressions (*note Regexp Search::). If TEST is omitted or ‘nil’, ‘equal’ is used for comparison.

If an alist element matches KEY by this criterion, then ‘assoc-default’ returns a value based on this element. If the element is a cons, then the value is the element’s CDR. Otherwise, the return value is DEFAULT.

If no alist element matches KEY, ‘assoc-default’ returns ‘nil’.

9.10.7. Function: copy-alist alist

This function returns a two-level deep copy of ALIST: it creates a new copy of each association, so that you can alter the associations of the new alist without changing the old one.

(setq needles-per-cluster
      '((2 . ("Austrian Pine" "Red Pine"))
        (3 . ("Pitch Pine"))
        (5 . ("White Pine"))))
(setq copy (copy-alist needles-per-cluster))

;; association lists are different
(eq needles-per-cluster copy) ;; nil
(equal needles-per-cluster copy) ;; t

;; assocations are different
(eq (car needles-per-cluster) (car copy)) ;; nil
(equal (car needles-per-cluster) (car copy)) ;; t

;; contents in the cons cells are the same
(eq (car (car needles-per-cluster)) (car (car copy))) ;; t

This example shows how ‘copy-alist’ makes it possible to change the associations of one copy without affecting the other:

(setcdr (assq 3 copy) '("Martian Vacuum Pine"))
(cdr (assq 3 needles-per-cluster))

9.10.8. Function: assq-delete-all key alist

This function deletes from ALIST all the elements whose CAR is ‘eq’ to KEY, much as if you used ‘delq’ to delete each such element one by one. It returns the shortened alist, and often modifies the original list structure of ALIST. For correct results, use the return value of ‘assq-delete-all’ rather than looking at the saved value of ALIST.

(setq alist (list '(foo 1) '(bar 2) '(foo 3) '(lose 4)))

(assq-delete-all 'foo alist) ;; ((bar 2) (lose 4))
alist ;; ((foo 1) (bar 2) (lose 4))

9.10.9. Function: assoc-delete-all key alist &optional test

This function is like ‘assq-delete-all’ except that it accepts an optional argument TEST, a predicate function to compare the keys in ALIST. If omitted or ‘nil’, TEST defaults to ‘equal’. As ‘assq-delete-all’, this function often modifies the original list structure of ALIST.

9.10.10. Function: rassq-delete-all value alist

This function deletes from ALIST all the elements whose CDR is ‘eq’ to VALUE. It returns the shortened alist, and often modifies the original list structure of ALIST. ‘rassq-delete-all’ is like ‘assq-delete-all’ except that it compares the CDR of each ALIST association instead of the CAR.

9.10.11. Macro: let-alist alist body

Looks like accessing the value with keys that are prefixed with a dot.

seems to have a typo.

Creates a binding for each symbol used as keys the association list

Creates a binding for each symbol used as keys in the association list ALIST, prefixed with dot. This can be useful when accessing several items in the same association list, and it’s best understood through a simple example:

(setq colors '((rose . red) (lily . white) (buttercup . yellow)))

(let-alist colors
  (if (eq .rose 'red)
      .lily)) ;; white

The BODY is inspected at compilation time, and only the symbols that appear in BODY with a ‘.’ as the first character in the symbol name will be bound. Finding the keys is done with ‘assq’, and the ‘cdr’ of the return value of this ‘assq’ is assigned as the value for the binding.

Nested association lists is supported:

(setq colors '((rose . red) (lily (belladonna . yellow) (brindisi . pink))))

(let-alist colors
  (if (eq .rose 'red)
      .lily.belladonna)) ;; yellow

Nesting ‘let-alist’ inside each other is allowed, but the code in the inner ‘let-alist’ can’t access the variables bound by the outer ‘let-alist’.

9.11. Property Lists

10. log time

Table 1: Clock summary at [2022-04-10 Sun 22:23]
Headline	Time
Total time	7:11
Intro	0:06
log time	7:05