lucene-cranfield-search-engine

🔍 A Lucene demo for searching the Cranfield collection.(用于搜索Cranfield数据集的Lucene搜索引擎示例。)

https://github.com/zhuagenborn/lucene-cranfield-search-engine

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.0%) to scientific vocabulary

Keywords

cranfield-collection lucene search-engine
Last synced: 6 months ago · JSON representation ·

Repository

🔍 A Lucene demo for searching the Cranfield collection.(用于搜索Cranfield数据集的Lucene搜索引擎示例。)

Basic Info
  • Host: GitHub
  • Owner: Zhuagenborn
  • License: mit
  • Language: Java
  • Default Branch: main
  • Homepage:
  • Size: 561 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
cranfield-collection lucene search-engine
Created over 4 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

README.md

Lucene Cranfield Search Engine

Java Maven License

Introduction

Cover

A Lucene demo for searching the Cranfield collection. It searches for each query in file cran.qry and displays the first two related documents in file cran.all.1400.

```console The results for the query with ID 27: papers on shock-sound wave interaction . SCORE: 7.306487, DOCUMENT ID: 64, TITLE: unsteady oblique interaction of a shock wave with plane disturbances . SCORE: 6.157679, DOCUMENT ID: 291, TITLE: sweepback effects in the turbulent boundary-layer shock-wave interaction .

The results for the query with ID 29: material properties of photoelastic materials . SCORE: 4.421124, DOCUMENT ID: 405, TITLE: tables of thermal properties of gases . SCORE: 4.305917, DOCUMENT ID: 553, TITLE: ablation of glassy materials around blunt bodies of revolution . ```

The workflow is as follows.

workflow

Getting Started

Prerequisites

Building

bash mvn package

Class Diagram

```mermaid classDiagram

namespace Lucene { class LuceneAnalyzer class LuceneSimilarity class LuceneScoreDoc class LuceneDocument class LuceneIndexSearcher }

namespace Cran { class CranField { Field id$ Field title$ Field author$ Field bibliography$ Field words$

    String tag
    String name
}

class CranDocument {
    int id
    String title
    String author
    String bibliography
    String words
}

class CranParser {
    parseDocuments(text)$ List~CranDocument~
    parseQueries(text)$ List~CranQuery~
}

class CranQuery {
    int id
    String words
}

}

CranParser ..> CranDocument CranParser ..> CranQuery

class Indexer { index(List~CranDocument~, path) }

Indexer --> LuceneAnalyzer Indexer --> LuceneSimilarity Indexer ..> CranField Indexer ..> CranDocument

class Searcher { search(CranQuery, top) List~LuceneScoreDoc~ documents(List~LuceneScoreDoc~) List~LuceneDocument~ document(LuceneScoreDoc) LuceneDocument }

Searcher --> LuceneIndexSearcher Searcher --> LuceneAnalyzer Searcher ..> LuceneScoreDoc Searcher ..> LuceneDocument Searcher ..> CranField Searcher ..> CranQuery ```

License

Distributed under the MIT License. See LICENSE for more information.

Owner

  • Name: Zhuagenborn
  • Login: Zhuagenborn
  • Kind: organization
  • Location: Ireland

Software Development | Artificial Intelligence | Reverse Engineering.

Citation (CITATION.cff)

cff-version: 1.2.0
authors:
- family-names: Liu
  given-names: Guowen
  orcid: https://orcid.org/0000-0002-8375-5729
- family-names: Chen
  given-names: Zhenshuo
  orcid: https://orcid.org/0000-0003-2091-4160
title: Lucene Cranfield Search Engine
date-released: 2022-11-21
url: https://github.com/Zhuagenborn/Lucene-Cranfield-Search-Engine

GitHub Events

Total
Last Year

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels