<?xml version='1.0' encoding='UTF-8'?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-03-15T17:26:49Z</responseDate>
  <request metadataPrefix="oai_dc" verb="GetRecord" identifier="oai:ipsj.ixsq.nii.ac.jp:00193907">https://ipsj.ixsq.nii.ac.jp/oai</request>
  <GetRecord>
    <record>
      <header>
        <identifier>oai:ipsj.ixsq.nii.ac.jp:00193907</identifier>
        <datestamp>2025-01-19T23:46:44Z</datestamp>
        <setSpec>581:9633:9634</setSpec>
      </header>
      <metadata>
        <oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns="http://www.w3.org/2001/XMLSchema" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
          <dc:title>Query Expansion for Microblog Retrieval Focusing on an Ensemble of Features</dc:title>
          <dc:title>Query Expansion for Microblog Retrieval Focusing on an Ensemble of Features</dc:title>
          <dc:creator>Abu, Nowshed Chy</dc:creator>
          <dc:creator>Md, Zia Ullah</dc:creator>
          <dc:creator>Masaki, Aono</dc:creator>
          <dc:creator>Abu, Nowshed Chy</dc:creator>
          <dc:creator>Md, Zia Ullah</dc:creator>
          <dc:creator>Masaki, Aono</dc:creator>
          <dc:subject>[一般論文] microblog search, query expansion, supervised learning, pseudo-relevance feedback, temporal information retrieval, convolutional long short-term memory, expansion term selection, word embedding</dc:subject>
          <dc:description>In microblog search, vocabulary mismatch is a persisting problem due to the brevity of tweets and frequent use of unconventional abbreviations. One way of alleviating this problem is to reformulate the query via query expansion. However, finding good expansion terms for a given query is a challenging task. In this paper, we present a query expansion framework, where supervised learning is adopted for selecting expansion terms. Upon retrieving tweets by our proposed topic modeling based query expansion, we utilize the pseudo-relevance feedback and a new temporal relatedness approach to select the candidate tweets. Next, we devise several new features to select the temporally and semantically relevant expansion terms by leveraging the temporal, word embedding, and sentiment association of candidate term and query. Moreover, we also utilize the lexical and twitter specific features to quantify the term relatedness. After supervised feature selection using regularized regression, we estimate the feature importance by applying random forest. Then, we make use of a learning-to-rank (L2R) framework to rank the candidate expansion terms. Results of extensive experiments on TREC Microblog 2011 and 2012 test collections over the Tweets2011 corpus show that our proposed method outperforms the baseline and competitive query expansion methods.
------------------------------
This is a preprint of an article intended for publication Journal of
Information Processing(JIP). This preprint should not be cited. This
article should be cited as: Journal of Information Processing Vol.27(2019) (online)
DOI　http://dx.doi.org/10.2197/ipsjjip.27.61
------------------------------</dc:description>
          <dc:description>In microblog search, vocabulary mismatch is a persisting problem due to the brevity of tweets and frequent use of unconventional abbreviations. One way of alleviating this problem is to reformulate the query via query expansion. However, finding good expansion terms for a given query is a challenging task. In this paper, we present a query expansion framework, where supervised learning is adopted for selecting expansion terms. Upon retrieving tweets by our proposed topic modeling based query expansion, we utilize the pseudo-relevance feedback and a new temporal relatedness approach to select the candidate tweets. Next, we devise several new features to select the temporally and semantically relevant expansion terms by leveraging the temporal, word embedding, and sentiment association of candidate term and query. Moreover, we also utilize the lexical and twitter specific features to quantify the term relatedness. After supervised feature selection using regularized regression, we estimate the feature importance by applying random forest. Then, we make use of a learning-to-rank (L2R) framework to rank the candidate expansion terms. Results of extensive experiments on TREC Microblog 2011 and 2012 test collections over the Tweets2011 corpus show that our proposed method outperforms the baseline and competitive query expansion methods.
------------------------------
This is a preprint of an article intended for publication Journal of
Information Processing(JIP). This preprint should not be cited. This
article should be cited as: Journal of Information Processing Vol.27(2019) (online)
DOI　http://dx.doi.org/10.2197/ipsjjip.27.61
------------------------------</dc:description>
          <dc:description>journal article</dc:description>
          <dc:date>2019-01-15</dc:date>
          <dc:format>application/pdf</dc:format>
          <dc:identifier>情報処理学会論文誌</dc:identifier>
          <dc:identifier>1</dc:identifier>
          <dc:identifier>60</dc:identifier>
          <dc:identifier>1882-7764</dc:identifier>
          <dc:identifier>AN00116647</dc:identifier>
          <dc:identifier>https://ipsj.ixsq.nii.ac.jp/record/193907/files/IPSJ-JNL6001032.pdf</dc:identifier>
          <dc:language>eng</dc:language>
        </oai_dc:dc>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>
