关于elasticsearch:48elasticsearch搜索引擎用Django实现搜索功能

39次阅读

共计 14041 个字符,预计需要花费 36 分钟才能阅读完成。

【百度云搜寻,搜各种材料:http://www.lqkweb.com】
【搜网盘,搜各种材料:http://www.swpan.cn】
【酷站群,搜各种网站源码:http://www.kuzq.cn】

Django 实现搜寻性能

1、在 Django 配置搜寻后果页的路由映射

"""pachong URL Configuration

The `urlpatterns` list routes URLs to views. For more information please see:
    https://docs.djangoproject.com/en/1.10/topics/http/urls/
Examples:
Function views
    1. Add an import:  from my_app import views
    2. Add a URL to urlpatterns:  url(r'^/pre>, views.home, name='home')
Class-based views
    1. Add an import:  from other_app.views import Home
    2. Add a URL to urlpatterns:  url(r'^/pre>, Home.as_view(), name='home')
Including another URLconf
    1. Import the include() function: from django.conf.urls import url, include
    2. Add a URL to urlpatterns:  url(r'^blog/', include('blog.urls'))
"""
from django.conf.urls import url
from django.contrib import admin
from app1 import views

urlpatterns = [url(r'^admin/', admin.site.urls),
    url(r'^/pre>, views.indexluoji),
    url(r'^index/', views.indexluoji),
    url(r'^suggest//pre>, views.suggestluoji,name="suggest"),     # 搜寻字段补全申请
    url(r'^search//pre>, views.searchluoji,name="search"),     # 搜寻

]

2、编写逻辑处理函数

在逻辑处理函数里实现搜寻数据

(1)获取到用户的搜索词

(2)利用原生的 elasticsearch(搜索引擎)接口,实现搜寻,注明:elasticsearch-dsl 就是在原生的 elasticsearch 上做了封装

Elasticsearch()办法,连贯原生的 elasticsearch 服务器

search()办法,原生 elasticsearch 查询方法,反对原生的 elasticsearch 查问语句,返回的原生语句后果也就是字典模式的数据

在查问语句里进行关键词高亮解决

将查问到的后果,循环获取到后返回到 html 页面

from django.shortcuts import render

# Create your views here.
from django.shortcuts import render,HttpResponse
from django.views.generic.base import View
from app1.models import lagouType               # 导入操作 elasticsearch(搜索引擎)类
import json
from elasticsearch import Elasticsearch         # 导入原生的 elasticsearch(搜索引擎)接口
client = Elasticsearch(hosts=["127.0.0.1"])     # 连贯原生的 elasticsearch

def indexluoji(request):
    print(request.method)  # 获取用户申请的门路
    return render(request, 'index.html')

def suggestluoji(request):                                      # 搜寻主动补全逻辑解决
    key_words = request.GET.get('s', '')                        # 获取到申请词
    re_datas = []
    if key_words:
        s = lagouType.search()                                  # 实例化 elasticsearch(搜索引擎)类的 search 查问
        s = s.suggest('my_suggest', key_words, completion={
            "field": "suggest", "fuzzy": {"fuzziness": 1},
            "size": 5
        })
        suggestions = s.execute_suggest()
        for match in suggestions.my_suggest[0].options:
            source = match._source
            re_datas.append(source["title"])
    return HttpResponse(json.dumps(re_datas), content_type="application/json")

def searchluoji(request):                                       # 搜寻逻辑解决
    key_words = request.GET.get('q', '')                        # 获取到申请词
    response = client.search(# 原生的 elasticsearch 接口的 search()办法,就是搜寻,能够反对原生 elasticsearch 语句查问
        index="lagou",                                          # 设置索引名称
        doc_type="biao",                                        # 设置表名称
        body={                                                  # 书写 elasticsearch 语句
            "query": {
                "multi_match": {                                # multi_match 查问
                    "query": key_words,                         # 查问关键词
                    "fields": ["title", "description"]          # 查问字段
                }
            },
            "from": 0,                                          # 从第几条开始获取
            "size": 10,                                         # 获取多少条数据
            "highlight": {                                      # 查问关键词高亮解决
                "pre_tags": ['<span class="keyWord">'],         # 高亮开始标签
                "post_tags": ['</span>'],                       # 高亮完结标签
                "fields": {                                     # 高亮设置
                    "title": {},                                # 高亮字段
                    "description": {}                           # 高亮字段}
            }
        }
    )
    total_nums = response["hits"]["total"]                      # 获取查问后果的总条数
    hit_list = []                                               # 设置一个列表来贮存搜寻到的信息,返回给 html 页面
    for hit in response["hits"]["hits"]:                        # 循环查问到的后果
        hit_dict = {}                                           # 设置一个字典来贮存循环后果
        if "title" in hit["highlight"]:                         # 判断 title 字段,如果高亮字段有类容
            hit_dict["title"] = "".join(hit["highlight"]["title"])      # 获取高亮里的 title
        else:
            hit_dict["title"] = hit["_source"]["title"]                 # 否则获取不是高亮里的 title

        if "description" in hit["highlight"]:                           # 判断 description 字段,如果高亮字段有类容
            hit_dict["description"] = "".join(hit["highlight"]["description"])[:500]    # 获取高亮里的 description
        else:
            hit_dict["description"] = hit["_source"]["description"]     # 否则获取不是高亮里的 description

        hit_dict["url"] = hit["_source"]["url"]                         # 获取返回 url

        hit_list.append(hit_dict)                                       # 将获取到内容的字典,增加到列表
    return render(request, 'result.html', {"all_hits": hit_list, "key_words": key_words})       #显示页面和将列表和搜索词返回到 html

3、html 页面接管搜寻后果

留神:因为 Django 实现了避免恶意代码写入,但凡通过变量传输到 html 页面的 html 类型代码,将会被主动转换成字符串形式显示,索引咱们须要在接管变量的字段用:{% autoescape off %} {{接管变量}} {% endautoescape %},来显示 html 代码,

搜寻后因为进行了一次跳转,所以搜寻框里的搜索词将不存在,咱们须要在传递搜寻后果到页面的时候,将搜索词也传递进来填充到搜 索框

<!DOCTYPE html >
<html xmlns="http://www.w3.org/1999/xhtml">
{# 引入动态文件门路 #}
{% load staticfiles %}
<head>
<meta http-equiv="X-UA-Compatible" content="IE=emulateIE7" />
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>python-lcv-search 搜索引擎 </title>
<link href="{% static'css/style.css'%}" rel="stylesheet" type="text/css" />
<link href="{% static'css/result.css'%}" rel="stylesheet" type="text/css" />
</head>
<body>
<div id="container">
    <div id="hd" class="ue-clear">
        <a href="/"><div class="logo"></div></a>
        <div class="inputArea">
            <input type="text" class="searchInput" value="{{key_words}}"/>
            <input type="button" class="searchButton" onclick="add_search()"/>
        </div>
    </div>
    <div class="nav">
        <ul class="searchList">
            <li class="searchItem current" data-type="article"> 文章 </li>
            <li class="searchItem" data-type="question"> 问答 </li>
            <li class="searchItem" data-type="job"> 职位 </li>
        </ul>
    </div>
    <div id="bd" class="ue-clear">
        <div id="main">
            <div class="sideBar">

                <div class="subfield"> 网站 </div>
                <ul class="subfieldContext">
                    <li>
                        <span class="name"> 伯乐在线 </span>
                        <span class="unit">(None)</span>
                    </li>
                    <li>
                        <span class="name"> 知乎 </span>
                        <span class="unit">(9862)</span>
                    </li>
                    <li>
                        <span class="name"> 拉勾网 </span>
                        <span class="unit">(9862)</span>
                    </li>
                    <li class="more">
                        <a href="javascript:;">
                            <span class="text"> 更多 </span>
                            <i class="moreIcon"></i>
                        </a>
                    </li>
                </ul>

                <div class="sideBarShowHide">
                    <a href="javascript:;" class="icon"></a>
                </div>
            </div>
            <div class="resultArea">
                <p class="resultTotal">
                    <span class="info"> 找到约 &nbsp;<span class="totalResult">45</span>&nbsp; 条后果(用时 <span class="time">0.643128</span> 秒),共约 <span class="totalPage">5</span> 页 </span>
                </p>
                <div class="resultList">
                    {% for hit in all_hits %}
                    <div class="resultItem">
                            <div class="itemHead">
                                <a href="{% autoescape off %} {{hit.url}} {% endautoescape %}"  target="_blank" class="title">{% autoescape off %} {{hit.title}} {% endautoescape %}</a>
                                <span class="divsion">-</span>
                                <span class="fileType">
                                    <span class="label"> 起源:</span>
                                    <span class="value"> 网络 </span>
                                </span>
                                <span class="dependValue">
                                    <span class="label"> 得分:</span>
                                    <span class="value">3.401155</span>
                                </span>
                            </div>
                            <div class="itemBody">
                                {% autoescape off %} {{hit.description}} {% endautoescape %}
                            </div>
                        </div>
                    {% endfor %}
                </div>
                <!-- 分页 -->
                <div class="pagination ue-clear"></div>
                <!-- 相干搜寻 -->

            </div>
            <div class="historyArea">
                <div class="hotSearch">
                    <h6> 热门搜寻 </h6>
                    <ul class="historyList">

                            <li><a href="/search?q=linux">linux</a></li>

                    </ul>
                </div>
                <div class="mySearch">
                    <h6> 我的搜寻 </h6>
                    <ul class="historyList">

                    </ul>
                </div>
            </div>
        </div><!-- End of main -->
    </div><!--End of bd-->
</div>

<div id="foot">Copyright &copy;projectsedu.com 版权所有  E-mail:admin@projectsedu.com</div>
</body>
<script type="text/javascript" src="{% static'js/jquery.js'%}"></script>
<script type="text/javascript" src="{% static'js/global.js'%}"></script>
<script type="text/javascript" src="{% static'js/pagination.js'%}"></script>
<script type="text/javascript">
    var search_url = "/search/"

    $('.searchList').on('click', '.searchItem', function(){$('.searchList .searchItem').removeClass('current');
        $(this).addClass('current');    
    });

    $.each($('.subfieldContext'), function(i, item){$(this).find('li:gt(2)').hide().end().find('li:last').show();});

    function removeByValue(arr, val) {for(var i=0; i<arr.length; i++) {if(arr[i] == val) {arr.splice(i, 1);
          break;
        }
      }
    }
    $('.subfieldContext .more').click(function(e){var $more = $(this).parent('.subfieldContext').find('.more');
        if($more.hasClass('show')){if($(this).hasClass('define')){$(this).parent('.subfieldContext').find('.more').removeClass('show').find('.text').text('自定义');
            }else{$(this).parent('.subfieldContext').find('.more').removeClass('show').find('.text').text('更多');    
            }
            $(this).parent('.subfieldContext').find('li:gt(2)').hide().end().find('li:last').show();}else{$(this).parent('.subfieldContext').find('.more').addClass('show').find('.text').text('收起');
            $(this).parent('.subfieldContext').find('li:gt(2)').show();}

    });

    $('.sideBarShowHide a').click(function(e) {if($('#main').hasClass('sideBarHide')){$('#main').removeClass('sideBarHide');
            $('#container').removeClass('sideBarHide');
        }else{$('#main').addClass('sideBarHide');    
            $('#container').addClass('sideBarHide');
        }

    });
    var key_words = "java"
    // 分页
    $(".pagination").pagination(45, {
        current_page :0, // 以后页码
        items_per_page :10,
        display_msg :true,
        callback :pageselectCallback
    });
    function pageselectCallback(page_id, jq) {window.location.href=search_url+'?q='+key_words+'&p='+page_id}

    setHeight();
    $(window).resize(function(){setHeight();    
    });

    function setHeight(){if($('#container').outerHeight() < $(window).height()){$('#container').height($(window).height()-33);
        }    
    }
</script>
<script type="text/javascript">
    $('.searchList').on('click', '.searchItem', function(){$('.searchList .searchItem').removeClass('current');
        $(this).addClass('current');
    });

    // 联想下拉显示暗藏
    $('.searchInput').on('focus', function(){$('.dataList').show()});

    // 联想下拉点击
    $('.dataList').on('click', 'li', function(){var text = $(this).text();
        $('.searchInput').val(text);
        $('.dataList').hide()});

    hideElement($('.dataList'), $('.searchInput'));
</script>
<script>
    var searchArr;
    // 定义一个 search 的,判断浏览器有无数据存储(搜寻历史)if(localStorage.search){
        // 如果有,转换成 数组的模式寄存到 searchArr 的数组里(localStorage 以字符串的模式存储,所以要把它转换成数组的模式)searchArr= localStorage.search.split(",")
    }else{
        // 如果没有,则定义 searchArr 为一个空的数组
        searchArr = [];}
    // 把存储的数据显示进去作为搜寻历史
    MapSearchArr();

    function add_search(){var val = $(".searchInput").val();
        if (val.length>=2){
            // 点击搜寻按钮时,去重
            KillRepeat(val);
            // 去重后把数组存储到浏览器 localStorage
            localStorage.search = searchArr;
            // 而后再把搜寻内容显示进去
            MapSearchArr();}

        window.location.href=search_url+'?q='+val+"&s_type="+$(".searchItem.current").attr('data-type')

    }

    function MapSearchArr(){
        var tmpHtml = "";
        var arrLen = 0
        if (searchArr.length > 6){arrLen = 6}else {arrLen = searchArr.length}
        for (var i=0;i<arrLen;i++){tmpHtml += '<li><a href="/search?q='+searchArr[i]+'">'+searchArr[i]+'</a></li>'
        }
        $(".mySearch .historyList").append(tmpHtml);
    }
    // 去重
    function KillRepeat(val){
        var kill = 0;
        for (var i=0;i<searchArr.length;i++){if(val===searchArr[i]){kill ++;}
        }
        if(kill<1){searchArr.unshift(val);
        }else {removeByValue(searchArr, val)
            searchArr.unshift(val)
        }
    }
</script>
</html>

最终成果

正文完
 0