Published April 29, 2018 | Version 1.1.0
Software Open

casics/spiral: Version 1.1.0

Creators

  • 1. California Institute of Technology

Description

Version 1.1.0 fixes a bug that prevented importing Spiral, and another bug that cause setup.py to fail to install dependencies automatically. Additional enhancements include improved command-line help and internal code refactoring.

Spiral is a Python module that provides several different functions for splitting identifiers found in source code files. Identifier splitting (also known as identifier name tokenization) is the task of breaking apart program identifier strings such as getInt or readUTF8stream into component tokens: [get, int] and [read, utf8, stream]. The need for splitting identifiers arises in a variety of contexts, including natural language processing (NLP) methods applied to source code analysis and program comprehension. Spiral provides some basic naive splitting algorithms, such as a straightforward camel-case splitter, as well as more elaborate heuristic splitters, such as an algorithm we call Ronin.

The name Spiral is a loose acronym based on "SPlitters for IdentifieRs: A Library".

Files

casics/spiral-1.1.0.zip

Files (19.3 MB)

Name Size Download all
md5:9ac2ad3caa294680c0008982ab18f5d6
19.3 MB Preview Download

Additional details

Related works