We consider several fundamental principles of rabbinic approaches to handling uncertainty for legal purposes. We find that non-numerical versions of ideas subsequently developed in the literature on ...interpretations of probabilistic statements are useful for explicating these rabbinic principles.
Almost any conceivable authorship attribution problem can be reduced to one fundamental problem: whether a pair of (possibly short) documents were written by the same author. In this article, we ...offer an (almost) unsupervised method for solving this problem with surprisingly high accuracy. The main idea is to use repeated feature subsampling methods to determine if one document of the pair allows us to select the other from among a background set of “impostors” in a sufficiently robust manner.
We consider a thought experiment in which voters could submit binary preferences regarding each of a pre-determined list of independent relevant issues, so that majorities could be tallied per issue. ...It might be thought that if such voting became technically feasible and widespread, parties and coalitions could be circumvented altogether and would become irrelevant. In this paper, we show, however, why and how voters would spontaneously self-organize into parties, and parties would self-organize into coalitions, prior to elections. We will see that such coordination is possible, even assuming very limited capabilities of communication and coordination. Using both analytical and empirical methods, we show that the average voter in a majority coalition would gain more than if no parties were formed, but the average voter overall (in or out of the coalition) would be worse off. Furthermore, the extent of these gains and losses is inversely proportional to the degree to which voters line along a unidimensional left–right axis.
The measurement of disproportionality, volatility and malapportionment often employ similar indices. Yet the debate on the issue of adequate measurement has remained open. We offer a formal and ...rigorous list of properties that roughly subsume those of Taagepera and Grofman (Party Polit 9(6): 659-677,2003). One of these properties, Dalton's principle of transfers, is formalized in a manner that resolves the ambiguity associated with it in previous studies. We show that the cosine measure satisfies all the properties. We also show how the Gallagher index can be modified to satisfy all the properties. The cosine measure and the modified Gallagher measure are co-monotone.
Authorship attribution in the wild Koppel, Moshe; Schler, Jonathan; Argamon, Shlomo
Language Resources and Evaluation,
03/2011, Letnik:
45, Številka:
1
Journal Article
Recenzirano
Most previous work on authorship attribution has focused on the case in which we need to attribute an anonymous document to one of a small set of candidate authors. In this paper, we consider ...authorship attribution as found in the wild: the set of known candidates is extremely large (possibly many thousands) and might not even include the actual author. Moreover, the known texts and the anonymous texts might be of limited length. We show that even in these difficult cases, we can use similarity-based methods along with multiple randomized feature sets to achieve high precision. Moreover, we show the precise relationship between attribution precision and four parameters: the size of the candidate set, the quantity of known-text by the candidates, the length of the anonymous text and a certain robustness score associated with a attribution.
•We shed new light on the authenticity of the writings of Julius Caesar.•Hirtius, one of Caesar’s generals, must have contributed to Caesar’s writings.•We benchmark two authorship verification ...systems on publicly available data sets.•We test on both modern data sets, and Latin texts from Antiquity.•We show how computational methods inform traditional authentication studies.
In this paper, we shed new light on the authenticity of the Corpus Caesarianum, a group of five commentaries describing the campaigns of Julius Caesar (100–44 BC), the founder of the Roman empire. While Caesar himself has authored at least part of these commentaries, the authorship of the rest of the texts remains a puzzle that has persisted for nineteen centuries. In particular, the role of Caesar’s general Aulus Hirtius, who has claimed a role in shaping the corpus, has remained in contention. Determining the authorship of documents is an increasingly important authentication problem in information and computer science, with valuable applications, ranging from the domain of art history to counter-terrorism research. We describe two state-of-the-art authorship verification systems and benchmark them on 6 present-day evaluation corpora, as well as a Latin benchmark dataset. Regarding Caesar’s writings, our analyses allow us to establish that Hirtius’s claims to part of the corpus must be considered legitimate. We thus demonstrate how computational methods constitute a valuable methodological complement to traditional, expert-based approaches to document authentication.
...if a piece of meat is found on the street ("parish"), but can be presumed to have originated in one of the ten shops, we regard it as having originated in one of the kosher shops, since they ...constitute a majority ("kol de-parish me-ruba parish"). ...we regard the probability that the meat is kosher as 50%. ...membership in the set is determined at a particular moment - specifically, the moment immediately prior to the piece of meat being specifi ed as an object the status of which we wish to determine ("leidat ha-safek") - and the choice of this specific moment as being critical is not an obvious one. ...a plain reading of the text of the gemara leaves no room for Kazhdan's explanation of kavu'a and even the attempt to salvage it through Tosafot's added stipulation does not save it.
We show how automatically extracted citations in historical corpora can be used to measure the direct and indirect influence of authors on each other. These measures can in turn be used to determine ...an author's overall prominence in the corpus and to identify distinct schools of thought. We apply our methods to two major historical corpora. Using scholarly consensus as a gold standard, we demonstrate empirically the superiority of indirect influence over direct influence as a basis for various measures of authorial impact.
We discuss a real‐world application of a recently proposed machine learning method for authorship verification. Authorship verification is considered an extremely difficult task in computational text ...classification, because it does not assume that the correct author of an anonymous text is included in the candidate authors available. To determine whether 2 documents have been written by the same author, the verification method discussed uses repeated feature subsampling and a pool of impostor authors. We use this technique to attribute a newly discovered Latin text from antiquity (the Compendiosa expositio) to Apuleius. This North African writer was one of the most important authors of the Roman Empire in the 2nd century and authored one of the world's first novels. This attribution has profound and wide‐reaching cultural value, because it has been over a century since a new text by a major author from antiquity was discovered. This research therefore illustrates the rapidly growing potential of computational methods for studying the global textual heritage.