Content Analysis DB > Fields > Source Types > My List ♥ ()

Advanced Algorithms Content Analysis Fields

Source Type: Advanced Algorithms

There are some algorithms, like local sensitivity hashes, that use probabilistic or other means to compute field values. In addition, there are some non-LLM models and algorithms for categorization (although these tend to be less useful except for categorizing to very common values like common news topics).

In Content Chimera

Chimera includes a local sensitivity hash probabilistic algorithm (and index) to find near text duplicates. We have also evaluated a variety of other algorithms and models that have not proven effective enough to include in the product.

See Advanced Algorithms fields below. Or show fields for all field types.
Has [Problem]
Yes or no, does this piece of content have this specific problem? The actual field name would depend on your situation, such as "Has Wall of Text".
General Usefulness:
Ease of Automation:
Compare with other Quality fields.
Near Text Duplicate
Is there a near text duplicate of the page? If so, what is the URL for that near duplicate.
General Usefulness:
Ease of Automation:
Compare with other Quality fields.
[Problem] Count
How often does the problem happen on the page? This would be a specific issue, so something like "Left Nav Count".
General Usefulness:
Ease of Automation:
Compare with other Quality fields.
[Problem] Example
An example of a problem (on a specific page) you are investigating. This field could be repeated in an analysis, with actual fields like "Table Example" or "Bad Character Encoding Example".
General Usefulness:
Ease of Automation:
Compare with other Quality fields.
Topic
The topic/subject of the content.
General Usefulness:
Ease of Automation:
Compare with other Category fields.