Discussions and future works
===
As the first exploration in end-to-end approaches to spoken question answering, the result of our experiments shows the feasibility of this research direction. While reasonable performance can be achieved by this approach, there is a large room for future research with the following issues addressed.
The first issue to address is the usage of word boundaries. Although it is legal to use an off-the-shelf ASR model that acts as a segmenter under the supervised se