PREPARATION OF UNITED STATES’ STATE COURT CASES DATASET AND DERIVING AN EFFICIENT EMBEDDING FOR CASES

Ravi, Karthik

PREPARATION OF UNITED STATES’ STATE COURT CASES DATASET AND DERIVING AN EFFICIENT EMBEDDING FOR CASES

Ravi, Karthik

2019

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Data mining techniques gained acceptance as a clear means of finding information in data. In the past, these techniques have been effectively applied to discover patterns, find correlations, extract information from unstructured data. Often overlooked is the major fact that data mining techniques were effectively applied only on scientific datasets like finance, healthcare, physics, and chemistry. We identified a study on the U.S. State Supreme Courts is significantly constrained by the lack of available data. To find an antidote to this problem, I have conducted research to produce an original dataset for every state supreme court ruling from 1953 through 2014. We have utilized dynamic textual analysis to search through the case files of thousands of state supreme court decisions and extract critical information on each case. We present trends analysis on the case distribution across states, the month of submitting the case, regional reporter, and legal issues being heard in front of the court. Following the synthesis of the dataset, we prepared a vector representation of the cases having similar characteristics based on the text in the overview section of the case using neural network architecture. Meanwhile, we used a generative statistical model to cluster the 2.1 Million cases extracted into 17 bins based on the word presence the case text. A validation study conducted on researchers in political science proves that all the three datasets 1. Original Supreme Court Dataset 2. Vector Representation of CaseSummary 3. Topic Clusters of the Cases Summary are highly structured, extremely meaningful. Hence, these datasets will offer scholars enormous possibilities to expand the knowledge of judicial politics in the American States.

Details

Title

PREPARATION OF UNITED STATES’ STATE COURT CASES DATASET AND DERIVING AN EFFICIENT EMBEDDING FOR CASES

Author

Ravi, Karthik (Computer Science)

Contributor

ProQuest (Firm) Contributor
University of North Carolina at Charlotte Degree Granting Institution
Shaikh, Samira Thesis Advisor
Jake, Minwoo Committee Member
Windett, Jason Committee Member

Date

2019

Publisher

University of North Carolina at Charlotte

Subjects

Computer science
Political science

Keywords

Cases; Deep Learning; Natural Language Processing; Politcal Science; Supreme Court; Text Mining

Link to This Page

Handle: http://hdl.handle.net/20.500.13093/etd:2205

Publication Type

masters theses

Pagination

1 online resource (66 pages) : PDF

File Format

application/pdf

Degree Type

M.S.

Usage Statement

This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). For additional information, see http://rightsstatements.org/page/InC/1.0/., (http://rightsstatements.org/page/InC/1.0/)
Copyright is held by the author unless otherwise indicated.

Record Appears in

Departments and Institutes > Computer Science
Types > Masters Theses
Graduate Theses and Dissertations
Graduate Thesis and Dissertations

PDF

Statistics

Download Full History

PREPARATION OF UNITED STATES’ STATE COURT CASES DATASET AND DERIVING AN EFFICIENT EMBEDDING FOR CASES

Files

Abstract

Details

Related Items

PDF

Statistics