afs_live_acc - AFS - Reference Guides

AFS Filters Description

Product
AFS
AFS_Version
7.12
Category
Reference Guides
language
English

The afs_live_acc filter computes the proximity of a document to every other documents from an external corpus. This computation generates data which are called ACC (Automatic Cross Content). The proximity is a number between 0 and 1. Two documents with a proximity close to 0 are totally different, and two documents with a proximity close to 1 are semantically very similar.

The filter is declared with the afs_live_acc type. It is in the antidot-paf-misc package. It is a processor filter.

This filter can be instantiated only once at any given moment. It will not read the "instances" parameter in the configuration.

The Live ACC filter specifications are described in the following table:

Parameter name

Mandatory

Type

Default

Description

input_layer

No

layer

CONTENTS

The layer from wich the data is read

min_proximity

No

float

0.1

Documents with a proximity below this value will be rejected.

min_words

No

integer

10

Documents with words count below this value will be rejected.

max_nb_neighbours

No

integer

4

The maximum number of closest documents.

nsmap

No

map

Empty map

Namespaces used to interpret the given xpath.

output_format

No

string

XML

Format into which the output will be serialized.

output_layer

Yes

layer

N/A

The layer where to expose the computed neighbours.

user_rule

No

string

N/A

LUA script used for filtering ACC data.

var_layer

No

layer

CONTENTS

It is the layer where variables declared by the variable parameter are located.

variables

No

map

Empty map

User generated variables to be used with user_rule parameter (for each, its name and its xpath).

Output example:
<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<afs:AccPaasResult xmlns:afs="http://ref.antidot.net/v7/afs#">
  <afs:neighbour uri="/uri/of/the/neighbour" title="The title of the neighbour" proximity="0.928"/>
  <afs:neighbour ... />
  ...
</afs:AccPaasResult>