Abstract
Small RNAs (sRNAs) are vital regulators of gene expression and involved in various biological processes. Among them, microRNAs (miRNAs) and phased small interfering RNAs (phasiRNAs) have been well defined and studied in the past decades. A bunch of scripts or pipelines were developed to annotate miRNAs and phasiRNAs. However, some computational annotations are rough and without careful manual check, resulting in low quality annotation. In this study, 19 public strawberry (Fragaria vesca) sRNA sequencing data from nine different tissues were collected to annotate miRNAs and PHAS loci in F. vesca. After bioinformatics analysis and careful manual checking, 167 known miRNAs, 27 miRNA*s with notable abundance, 54 novel miRNAs were accurately annotated. The terms of two miRNAs were corrected from miR477b and miR5225 using miRN47 and miR3627h, respectively. Besides 21 nucleotides (nt) miR390, eleven miRNAs with a length of 22-nt are in charge of triggering the biogenesis of 21-nt phasiRNAs from 110 PHAS loci in strawberry. In particular, we found several PHAS loci were targeted by two different miRNAs (similar to the “two-hit” model) and the phasiRNA generating region located between two target sites. We speculate that one target site is in control of triggering phasiRNA biogenesis and the other target site define the boundary of the region of phasiRNA biogenesis, which likely provide an accurate way for phasiRNA generation. Overall, we provided a comprehensive and accurate annotation of miRNAs and PHAS loci in the F. vesca genome.