Skip to content

This is a web frontend for a zotero database which makes your literature database easily publicly accessible

License

Notifications You must be signed in to change notification settings

Tomptez/Bibfilter

Repository files navigation

Bibfilter

A Searchable Literature database web-interface for use with Zotero.

It is developed using python and flask and can easily be deployed using Dokku.

For advanced searching features, Elasticsearch can be used as a backend for searching.

Screenshot

Background

It was developed specifically as part of the German Science Foundation funded project "The Reciprocal Relationship of Public Opinion and Social Policy" (BR 5423/2-1). As such it was designed to provide an interactive tool for searching through all known literature in the specific subject area of public preferences for social policy and redistribution. However, it can easily be deployed with any literature library contained in a Zotero folder. A current version of bibfilter is used on the webpage of the Social Policy Preferences Network with the server hosted by the University of Bremen's Bremen International Graduate School of Social Sciences.

Table of contents

  1. How to run the application using Docker Compose

  2. Bulding the application manually

    2.1 Prerequisites

    2.2 Installation

    2.3 Setting up environment variables

    2.4 Normal Usage

  3. Deploying the application via Dokku

    3.1 Setting up the application for dokku

    3.2 Updating the application remotely

    3.3 Restarting the Application

How to run the application using Docker Compose

Using docker-compose is the easiest option to run bibfilter as the provided docker-compose.yml shows an example how to run bibfilter with a complete setup including postgresql and elasticsearch. For that you need to have Docker and docker-compose installed (For ubuntu based systems, we recommend using the package 'docker-compose-v2'). After pulling the repository run

cp example.env .env

And edit the .env file so that it reflects your zotero library and other variables according to your needs (For a more detailed explanation see Setting up environment variables).

After that, simply run

docker compose build

To build the application. You can then run the application by running

docker compose up

The provided docker-compose.yml provides a tested example configuration, but should be configured according to your needs before using it in a production environment.

Bulding the application manually

Prerequisites

First make sure you have installed git, Bash (which comes with git on windows) and Python version 3.9 or newer. You also need pip which will usually be installed together with python on Windows and Mac. You will also need PostgreSQL to create the database. For Mac, we recommend you to use Postgres.app. Depending on your system you may need Microsoft Visual C++ installed and/or updated.

Elasticsearch for testing

Optionally, if you want to make use of advanced searching capabilities and search through the fulltext of the articles, Elasticsearch needs to be installed - Windows instructions here.

When using docker this can be done as follows (note that this installation is not a sufficient for production and should only be used for testing)

docker pull docker.elastic.co/elasticsearch/elasticsearch:8.8.0
docker volume create es-data

Now you can run elasticsearch as follows

docker run -p 127.0.0.1:9200:9200 -p 127.0.0.1:9300:9300 -e "discovery.type=single-node" -e "xpack.security.enabled=false" --mount source=es-data,target=/usr/share/elasticsearch/data docker.elastic.co/elasticsearch/elasticsearch:8.8.0

Installation

Next, clone the repository to your local machine from your (git) bash terminal

git clone https://github.com/Tomptez/bibfilter.git

and then open the cloned directory

cd bibfilter

The next step is to create a virtual environment and install all the required modules using venv. First, create a new virtual environment called 'venv' and activate it.

python -m venv venv

MacOS and Linux

source venv/bin/activate

Windows

venv\Scripts\activate.bat

Note that "python" or "py" commands may not run. If this is the case it is likely due to one or both of the following. (A) Windows 10 default launching location for Python (and every app) is the AppData folder. So it tries to launch an app. If you installed Python from Python.org (recommended) rather than the Microsoft Store (not recommended) then you need to change the computer's environment variables. This is done by clicking the 'start' button which is now the Windows icon in the lower left corner, and type "Env". What should pop up is "Edit the system environment variables". Click on this, then click "EnvironmentVariables...", then select "Path" under "User variables" in the list and click "Edit". Create a "New" path and put in the exact directory where Python was installed (e.g. C:\Python311). then select this and click "Move Up" until it is at the top. (B) You need to turn of auto app launching. Go to "Apps" and click "App execution aliases". Turn off all App Installers associated with Python (there are usually two of them). After doing this, restart and try again.

You should now see (venv) in your command line. Now we need to make sure pip is up to date

pip install --upgrade pip

After that we initiate the virtual environment by installing the exact packages and versions needed. This can take a while.

pip install -r requirements.txt

To use the application, you will also need to create a PostgreSQL Database called bibfilter and install an extension called unaccent. This is done by typing in your command line cmd (this may or may not work in git bash)

psql -U postgres

If that does not work, you can specify the exact filepath of your postgres installation (for example)

"C:\Program Files\PostgreSQL\12\bin\psql.exe" -U postgres

Or, you can follow the instructions above for setting Python PATH in your Windows Environment, and add two new lines that point directly at the \bin and \lib directories in your PostgreSQL[version#]\ folder

For Mac, this is done by typing in your git bash

psql -d postgres

To create the database you type

CREATE DATABASE bibfilter;

Then you can select the database and install the unaccent extension

\c bibfilter;
CREATE EXTENSION unaccent;

Setting up environment variables


NOTE

To open bash in Windows close the cmd window and type "bash" into the start icon. Then make sure to navigate to the bibfilter folder, e.g. by typing

cd C:\bibfilter

Before we can start the application, we need to set our environment variables.

To do so, we need a new file in the main directory (in which the Pipfile sits) called .env where you define the environment variables which we will use in the project. There is an example.env file we can use as a template. Let's copy it as our real .env file and edit it. We will use the terminal editor nano. In your bash terminal type

cp example.env .env
nano .env

You can change DATABASE_URL, so that postgres is your PostgreSQL username (postgres is the standard user which you can keep) and mypassword with the password that you chose for that user during the install.

For macOS, you can change DATABASE_URL, so that it matches with the connection URL that PostgreSQL server provides for the database bibfilter. This URL requires you to input the username postgres and the password you specified. In Windows for example this looks like "postgresql://postgres:[password you set previously]@localhost:5432/bibfilter". If you start your password with '@' it will not work!

You can then change the values of APP_USERNAME and APP_PASSWORD to whatever you like. They will be used for the login for the /admin page. LIBRARY_ID and COLLECTION_ID should reflect the respective ID of your zotero Library and collection. You can retrieve these IDs from the adress field in your browser if you open the collection at zotero.org in your browser. Note that it has to be a public library, otherwise you also need to use an API-Key which this application does not yet account for. SUGGEST_LITERATURE_URL should be the URL to a page where users can suggest articles to add.

If you also want to use elasticsearch, depending on your setup you might need to change the URL of your elasticsearch instance ELASTICSEARCH_URL, or might need to specify ELASTICSEARCH_PASSWORD or ELASTICSEARCH_USERNAME. In that case, uncomment and change these lines accordingly.

You can now close the nano editor by hitting Ctr+X and then typing Y and then Enter to save the file.

You could now already open the application but it will not show any articles because we have no database yet. To create the database you need run update_library.py once

python update_library.py

Note: if you are using a more recent version of Python3 you might have to update some of the requirements (schedule, pytz, etc), just type pip install [name of package]

If you see a report (Summary of synchronization with Zotero), you can stop the process using Ctrl + C. If there's any errors, there is probably an issue with your postgreql setup. If not, you have successfully setup the database and can now start the application with

python app.py

In you browser, you can now find the page at the URL http://127.0.0.1:5000/ If you want to close the server, you can hit Ctr+C

If your are finished, you can close the virtual environment by typing

deactivate

or just close the terminal.

Normal Usage

After the initial setup when you want to start the application, you should first open the repository folder in your bash terminal.

Then you can check whether there have been any updates in the gitlab repository and if there is, download them by running

git pull

to start the application, you always need to launch the virtual environment first

source venv/bin/activate

After that, you should see a (venv) in your command prompt which indicates the venv is active. You can exit the venv by typing exit

After activating the virtual environment, you can start the flask app with

python app.py

Deploying the application via Dokku

There are several ways to put the bibfilter application online. Here we will explain how to get bibfilter running on a servere using dokku.

1. Setting up the application for dokku

This assumes you have got a running instance of dokku on your server.

First, you need to create a new dokku application

dokku apps:create my_application_name

Next, you need to install postgres on dokku and create a new database. Afterwards you need to connect to the SQL interface, add the extension unaccent and then link the database to the application.

sudo dokku plugin:install https://github.com/dokku/dokku-postgres.git
dokku postgres:create my_database_name
dokku postgres:connect my_database_name
CREATE EXTENSION unaccent;
exit
dokku postgres:link my_database_name my_application_name

For flask_limiter, you also need to create a memcached service as a backend and link it to the application

sudo dokku plugin:install https://github.com/dokku/dokku-memcached.git memcached
dokku memcached:create memcached_backend
dokku memcached:link memcached_backend my_application_name

After that, you need to create the required environment variables. Of course you need to change it to the correct values. Note: This command is supposed to be one line not seperate lines.

dokku config:set my_application_name APP_USERNAME=my_admin_name APP_PASSWORD=my_password LIBRARY_ID=my_public_zotero_library_id COLLECTION_ID=my_public_zotero_collection_id SUGGEST_LITERATURE_URL=url_of_form_where_one_can_suggest_an_article SHOW_SEARCH_QUOTES = "TRUE" USE_ELASTICSEARCH = "FALSE"

Lastly, we want to add the letsencrypt plugin to the application, so that our application runs on a secure connection

sudo dokku plugin:install https://github.com/dokku/dokku-letsencrypt.git
dokku config:set --no-restart my_application_name [email protected]
dokku letsencrypt my_application_name

We also want to setup letsencrypt to automatically update

With dokku it easy to create and renew the certificates. The documentation can be found here

To renew all certificates once, just log into the server and run

dokku letsencrypt:auto-renew

To make dokku renew certificates automatically, run

dokku letsencrypt:cron-job --add


2. Updating the application remotely

After you have set up the aplication on the dokku instance, you can basically use it the way you use a regular git repository and easily add any changes. On your local computer you want to add the server and application-name as a git repository

git remote add remote_name [email protected]:my_application_name

Change 'remote_name' to whatever you like. Then you can easily push the application using git. You do however need to make sure that you have the right to push to dokku. For that, you might need an SSH-Key, which possibly has to be added to the whitelist of the server as well as to the whitelist of dokku. Depending on the setup of the server you might also need to be connected to a specific network (for example via VPN).

git push remote_name main

If there are problems online, you might want to read to logs of your running application

3. Restarting the Application

In case of issues a restart may be necessary

  1. Login to the server using SSH key.
    • Open command prompt.
    • Connect to the server by typing (you need to be in the university or be connected to the VPN)
    • You might have to enter the password you chose when setting the ssh key up
  1. Check the logs
dokku logs my_application_name --tail
  1. Restart
dokku ps:start bibfilter

Contributors

Tom Knuf, Nate Breznau, Karen Kuribayashi

About

This is a web frontend for a zotero database which makes your literature database easily publicly accessible

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •