Software Open Access

detect-delimiter software project

McNamara, Timothy S


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>McNamara, Timothy S</dc:creator>
  <dc:date>2018-07-29</dc:date>
  <dc:description>Data sharing and research collaboration is hindered when messy data is released. Despite this, it still exists. detect-delimiter aims to provide tools with an option for inspecting the header of a text file and checking the actual delimiter used, rather than blindly believing the file extension.

Usage

The detect_delimiter module exposes the detect() functinon, which takes a str as input and returns a delimiter.

&gt;&gt;&gt; from detect_delimiter import detect
&gt;&gt;&gt; detect("looks|like|the vertical bar\n is|the|delimiter\n")
'|'

When detect() doesn't know, it returns None:

&gt;&gt;&gt; text = "not really any delimiters in here.\nthis is just text.\n"
&gt;&gt;&gt; detect()

It's possible to provide a default, which will be used in that case:

&gt;&gt;&gt; detect(text, default=',')
','


By default, detect() will prevent avoid checking alpha-numeric characters and the period/full stop character ("."). This can be adjusted via the blacklist parameter.

If you believe that you know the delimiter, it's possible to provide a list of possible delimiters to check for via the whitelist parameter. If you don't provide a value, [',', ';', ':', '|', '\t'] will be checked.</dc:description>
  <dc:description>This Python package can also be installed via "pip": "pip install detect-delimiter".</dc:description>
  <dc:identifier>https://zenodo.org/record/1323186</dc:identifier>
  <dc:identifier>10.5281/zenodo.1323186</dc:identifier>
  <dc:identifier>oai:zenodo.org:1323186</dc:identifier>
  <dc:language>eng</dc:language>
  <dc:relation>doi:10.5281/zenodo.1323185</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:subject>text processing</dc:subject>
  <dc:subject>Python programming</dc:subject>
  <dc:title>detect-delimiter software project</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>software</dc:type>
</oai_dc:dc>
112
23
views
downloads
All versions This version
Views 112112
Downloads 2323
Data volume 71.3 kB71.3 kB
Unique views 101101
Unique downloads 2222

Share

Cite as