{ "access": { "embargo": { "active": false, "reason": null }, "files": "public", "record": "public", "status": "open" }, "created": "2022-07-20T22:08:07.991832+00:00", "custom_fields": { "meeting:meeting": { "dates": "18-21 July 2022", "place": "Porto, Portugal", "title": "4th European Conference of the International Association for Forensic and Legal Linguists", "url": "https://iafllporto2022.wordpress.com/" } }, "deletion_status": { "is_deleted": false, "status": "P" }, "files": { "count": 1, "enabled": true, "entries": { "Using likelihood ratio framework.pdf": { "checksum": "md5:442583ae010eb2b777d09dde91504ac8", "ext": "pdf", "id": "119abc4d-5adc-4c67-aaea-e5ac7461d3fd", "key": "Using likelihood ratio framework.pdf", "metadata": null, "mimetype": "application/pdf", "size": 5938648 } }, "order": [], "total_bytes": 5938648 }, "id": "6872155", "is_draft": false, "is_published": true, "links": { "access": "https://zenodo.org/api/records/6872155/access", "access_links": "https://zenodo.org/api/records/6872155/access/links", "access_request": "https://zenodo.org/api/records/6872155/access/request", "access_users": "https://zenodo.org/api/records/6872155/access/users", "archive": "https://zenodo.org/api/records/6872155/files-archive", "archive_media": "https://zenodo.org/api/records/6872155/media-files-archive", "communities": "https://zenodo.org/api/records/6872155/communities", "communities-suggestions": "https://zenodo.org/api/records/6872155/communities-suggestions", "doi": "https://doi.org/10.5281/zenodo.6872155", "draft": "https://zenodo.org/api/records/6872155/draft", "files": "https://zenodo.org/api/records/6872155/files", "latest": "https://zenodo.org/api/records/6872155/versions/latest", "latest_html": "https://zenodo.org/records/6872155/latest", "media_files": "https://zenodo.org/api/records/6872155/media-files", "parent": "https://zenodo.org/api/records/6872154", "parent_doi": "https://zenodo.org/doi/10.5281/zenodo.6872154", "parent_html": "https://zenodo.org/records/6872154", "requests": "https://zenodo.org/api/records/6872155/requests", "reserve_doi": "https://zenodo.org/api/records/6872155/draft/pids/doi", "self": "https://zenodo.org/api/records/6872155", "self_doi": "https://zenodo.org/doi/10.5281/zenodo.6872155", "self_html": "https://zenodo.org/records/6872155", "self_iiif_manifest": "https://zenodo.org/api/iiif/record:6872155/manifest", "self_iiif_sequence": "https://zenodo.org/api/iiif/record:6872155/sequence/default", "versions": "https://zenodo.org/api/records/6872155/versions" }, "media_files": { "count": 0, "enabled": false, "entries": {}, "order": [], "total_bytes": 0 }, "metadata": { "creators": [ { "affiliations": [ { "name": "University of Manchester" } ], "person_or_org": { "family_name": "Andrea Nini", "identifiers": [ { "identifier": "0000-0003-4218-5130", "scheme": "orcid" } ], "name": "Andrea Nini", "type": "personal" } } ], "description": "
The likelihood ratio framework is an ideal way for an expert witness to present their evidence in court because it reflects their duty of expressing the strength of evidence in favour of a certain hypothesis (Morrison, 2009). Recent research by Ishihara (2021) demonstrates how this approach can be applied to authorship identification. In this talk I will describe the application of this framework to a real-life authorship identification case involving text messages. The method adopted is a special type of the General Impostors method, the state-of-the-art method for authorship verification problems (Koppel and Winter, 2014). The drawback of this method as applied in computer science is that it is dependent on dynamic feature sets, such as character 4-grams. These features are difficult to interpret and sensitive to topic and register variation. Instead, I will show how a manually curated static feature set, similar to a writeprint (Abbasi and Chen, 2008), can lead to equally excellent performance while limiting the capturing of confounding information. I will conclude by arguing that the move to the likelihood ratio framework for forensic authorship identification is not a goal in the distant future but a reality that should be adopted now.
\n\n\n\n
References
\n\nAbbasi, A. and Chen, H. (2008) Writeprints\u202f: A stylometric approach to identity-level identification and similarity detection in cyberspace, In ACM Transactions on Information Systems, New York, NY, USA.
\n\nIshihara, S. (2021) Score-based likelihood ratios for linguistic text evidence with a bag-of-words model, Forensic Science International, Elsevier, 327, p. 110980.
\n\nKoppel, M. and Winter, Y. (2014) Determining if two documents are written by the same author, Journal of the Association for Information Science and Technology, 65(1), pp. 178–187.
\n\nMorrison, G. S. (2009) Forensic voice comparison and the paradigm shift, Science and Justice, 49(4), pp. 298–308.
", "publication_date": "2022-07-20", "publisher": "Zenodo", "resource_type": { "id": "presentation", "title": { "de": "Pr\u00e4sentation", "en": "Presentation" } }, "rights": [ { "description": { "en": "The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited." }, "icon": "cc-by-icon", "id": "cc-by-4.0", "props": { "scheme": "spdx", "url": "https://creativecommons.org/licenses/by/4.0/legalcode" }, "title": { "en": "Creative Commons Attribution 4.0 International" } } ], "subjects": [ { "subject": "forensic linguistics" }, { "subject": "forensic science" }, { "subject": "authorship analysis" } ], "title": "Using the likelihood ratio framework in real authorship identification casework: The General Impostors with Writeprints method" }, "parent": { "access": { "owned_by": { "user": 100318 } }, "communities": {}, "id": "6872154", "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.6872154", "provider": "datacite" } } }, "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.6872155", "provider": "datacite" }, "oai": { "identifier": "oai:zenodo.org:6872155", "provider": "oai" } }, "revision_id": 2, "stats": { "all_versions": { "data_volume": 498846432.0, "downloads": 84, "unique_downloads": 73, "unique_views": 137, "views": 144 }, "this_version": { "data_volume": 492907784.0, "downloads": 83, "unique_downloads": 72, "unique_views": 135, "views": 142 } }, "status": "published", "updated": "2022-07-21T01:49:31.137553+00:00", "versions": { "index": 1, "is_latest": true } }