Questions LOCOST - ufal/NPFL095 GitHub Wiki

What other ways (at least one) can we use to achieve less than asymptotically quadratic time complexity (o(n^2)) as a function of context window size? Some have been shown before but if you have no other ideas or want to see a new one: https://aclanthology.org/2022.findings-naacl.55/
Estimate which sentence sets could be chosen by the GSG as the pseudo summary for k = 1 from the document sentence set:

{"cats playing", "dogs sleeping", "cats dogs in house"}

Check by calculating the explicit average ROUGE-1F1 scores with the rest of the document for each sentence.

ROUGE-1recall= percentage of unigrams from reference contained in candidate

ROUGE-1precision= percentage of unigrams from candidate contained in reference (used to be percentage of unigrams from reference contained in candidate, solutions using this wrong definition are also valid)

ROUGE-1F1= 2⋅recall⋅precision / (recall+precision)

F1 score is 0 if both precision and recall are 0.

let us have

A = 
1.0 0.0
0.0 2.0

b = (2, 1)^T
c = (1, 3)^T
u = (1, 2)^T
d = 0

Calculate the vector y as in eq. 2 using the definition of circular convolution in eq. 6 (we modulo the indexes as we would with python indexing).

Calculate U and K, the discrete Fourier transforms of u and k. Calculate the inverse DFT of the point-wise product of UK.

bonus: Show that in general multiplying 2d vectors in the spectral domain and then transforming them back is the same as calculating their convolution.

hint: to get the vector z containing the discrete Fourier transform of y we use:

z1 = y1 + y2
z2 = y1 − y2
y1 = (z1 + z2) / 2
y2 = (z1 - z2) / 2