{
  "DOI": "10.5281/zenodo.1168703",
  "abstract": "PAM\nPAM (Parallel Augmented Maps) is a parallel C++ library implementing the interface for augmented maps [1].  It is designed for maintaining an ordered map data structure while efficiently answering range-based and related queries.    In the experiments we use the interface in four examples: augmented-sums, interval-queries, 2d range-queries, and an inverted index.    The released code includes the examples and scripts for running the specific experiments reported in the paper.   It is also designed so it is easy to try in many other scenarios (different sizes, different numbers of cores, and other operations described in the paper, but not reported in the experiments).\n\nMore details and examples can be found in our paper [1].\nUsage:\n\nTo define an augmented map using PAM, user need to specify the parameters including type names and (static) functions in the entry structure ``entry''.\n\n\n\ntypename key_t: the key type (K),\n\nfunction comp: K x K -> bool: the comparison function on K (<_K)\n\ntypename val_t: the value type (V),\n\ntypename aug_t: the augmented value type (A),\n\nfunction from_entry: K x V -> A: the base function (g)\n\nfunction combine: A x A -> A: the combine function (f)\n\nfunction get_empty: empty -> A: the identity of f (I)\n\n\nThen an augmented map is defined with C++ template as\n\naugmented_map<entry>.\n\n\nNote that a plain ordered map is defined as an augmented map with no augmentation (i.e., it only has K, <_K and V in its entry) and a plain ordered set is similarly defined as an augmented map with no augmentation and no value type.\n\nHere is an example of defining an augmented map \"m\" that has integer keys and values and is augmented with value sums (similar as the augmented sum example in our paper [1]):\n\nstruct entry {\n  using key_t = int;\n  using val_t = int;\n  using aug_t = int;\n  static bool comp(key_t a, key_t b) { \n    return a < b;}\n  static aug_t get_empty() { return 0;}\n  static aug_t from_entry(key_t k, val_t v) { \n    return v;}\n  static aug_t combine(aug_t a, aug_t b) { \n    return a+b;}};\naugmented_map<entry> m;\n\n\nAnother quick example can be found in [1], which shows how to implement an interval tree using the PAM interface.\nHardware dependencies\n\nAny modern (2010+) x86-based multicore machines.  Relies on 128-bit CMPXCHG (requires -mcx16 compiler flag) but does not need hardware transactional memory (TSX).  Most examples given in our scripts require 64GB memory, but range_query requires 256GB memory and aug_sum on the large input requires 1TB memory.  All the examples can take smaller input size by setting command line arguments.\nSoftware dependencies\n\nPAM requires g++ 5.4.0 or later versions supporting the Cilk Plus extensions.    The scripts that we provide in the repository use \"numactl\" for better performance. All tests can also run directly without \"numactl\".\n\nWe use python to write a script to organize all results and compute speedup. It is not required to run tests.\nDatasets\n\nWe use the publicly available Wikipedia database (dumped on Oct. 1, 2016) for the inverted index experiment.  We release a sample (1% of total size) in the github repository (35MB compressed).  The full data (3.5TB compressed) is available on request.  All other applications use randomly generated data.\nExperiment Workflow:\n\nAt the top level there is a makefile (make) and a script for compiling and running all timings (./run_all).  The source code\nof the library is provided in the directory c++/, and the other directories\neach corresponds to some examples of applications. There are\nfour example applications provided in our repository:\n\n\n\nThe range sum (in directory aug_sum/).\n\nThe interval tree (in directory interval/).\n\nThe range tree (in directory range_query/).\n\nThe inverted indices (in directory index/).\n\n\nIn each of the directories there is a separated makefile and a script to run the timings for the corresponding application.\n\nAll tests include parallel and sequential running times.  The sequential versions are the algorithms running directly on one thread, and the parallel versions use all threads on the machine using \"numactl -i all\".\n\nTo run all tests, just type the following at the top level:\n\nmake\n./run_all.sh\n\n\nBy default the script will not include the tests on very large input sizes (10 billion), which costs long. Users can use\n\nmake\n./run_all.sh -l\n\n\nto include large tests.\n\nTo run separated tests on each application, users can also go to each sub-directory to run the scripts.\n\nWe recommend to use numactl -i all on all parallel tests.\nAugmented Sum (/aug_sum/)\n\nUsing\n\nmake\n\n\nwill give two executable files aug_sum (augmented version) and aug_sumNA (non-augmented version). This is done by setting flag -DNO_AUG in compiling time.\n\nUsing\n\n./run_aug_sum.sh -l\n\n\nwill run all experiments as shown in Table 3 in [1] on augmented sum. If you do not want to run on large input, remove \"-l\".\n\nUsing\n\n./runall [-r rounds] [-p threads]\n\n\nwill run all functions as shown in Table 3 in [1] with 'rounds' rounds and 'threads' threads. By default rounds=3 and threads=nproc --all (maximum number of threads on the machine).\n\nBoth scripts will output to both stdout and a file res.txt. The script run_aug_sum.sh will then call a python code to give all results (timings and speedups) in a file data.txt.\n\nIf user wants to directly run our executable file (aug_sum or aug_sumNA), the arguments are listed as follows:\n\n./aug_sum [-n size1] [-m size2] [-r rounds] [-p] <testid>\n\nInterval Tree (/interval/)\n\nUsing\n\nmake\n\n\nwill give the executable file (interval).\n\nUsing\n\nrun_interval\n\n\nwill give the same experiment of interval trees as shown in Table 5 in [1].\n\nTo directly run the executable file (interval), one can try:\n\n./interval n q r\n\n\nwhere n stands for the number of intervals, q is the number of querys, r is the number of rounds. By default n=100000000, q=n, r=5.\nRange Tree (/range_query/)\n\nUsing\n\nmake\n\n\nwill give the executable file (rt_test).\n\nUsing\n\n./run_range\n\n\nwill give the same experiment of range trees as shown in Table 5 in [1].\n\nTo directly run the executable file (rt_test), one can try:\n\n./rt_test [-n size] [-l rmin] [-h rmax] [-r rounds] [-q queries] [-w window] [-t query_type]\n\n\nwhere 'size' stands for the number of points, 'rmin' and 'rmax' are the upper and lower bound of coordinates, 'rounds' is the number of rounds, 'queries' is the number of queries, 'window' is the query window size (for one dimension), 'query_type' is 0 for query-all, and 1 for query-ssum. By default n=100000000, l=0, h=1000000000, r=3, q=1000, w=1000000, t=0.\nInverted Index\n\nUsing\n\nmake\n\n\nwill give the executable file (index).\n\nUsing\n\n./run_index\n\n\nwill give the same type of experiment of inverted index as shown in Table 6 in [1], but on a smaller input size.\n\nTo directly run the executable file (index), one can try:\n\n./index [-o] [-v] [-n max_chars] [-q num_queries] [-f file]\n\n\nwhere '-o' indicates an output file of query results, '-v' means to output verbose information, '-n' means the length to read from a file, '-q' is the number of queries, and '-f' is the input file. By default n=1000000000000 (just read the whole file), q=10000, f='wiki_small.txt'.\nReference\n\n[1] Yihan Sun, Daniel Ferizovic, and Guy E. Blelloch. PAM: Parallel Augmented Maps. PPoPP 2018.",
  "author": [
    {
      "family": "syhlalala"
    }
  ],
  "id": "1168703",
  "issued": {
    "date-parts": [
      [
        "2018",
        "02",
        "07"
      ]
    ]
  },
  "publisher": "Zenodo",
  "title": "syhlalala/PAM-AE: PAM",
  "type": "software",
  "version": "v1.0"
}