CEPIC: The Chinese/English Political Interpreting Corpus

Project: Digital scholarship project

Project Details


Project co-developed with the University Library.

The CEPIC consists of transcripts of speeches delivered by top political figures from Hong Kong, Beijing, Washington DC and London, as well as their translated/interpreted texts.

The main speech types of CEPIC include the reading of government reports such as policy addresses and budget speeches, Q&A at press conferences, parliamentary debates, as well as remarks delivered at bilateral meetings.

The corpus features a parallel display of up to six versions of the same speech segment, aligned at paragraph level. Apart from POS tagging, the corpus is also annotated with different prosodic and paralinguistic features that are of concern to the study of spoken language as well as interpreting.

The CEPIC can be used to investigate matters relating to Chinese/English political translation/interpreting and political discourse at large. It can also serve students, teachers, as well as people working in political settings, in aspects of political speech delivery and translation/interpreting production. Users can also download search results from the corpus for their own teaching/research purposes.
Effective start/end date12/03/1820/06/19

UN Sustainable Development Goals

In 2015, UN member states agreed to 17 global Sustainable Development Goals (SDGs) to end poverty, protect the planet and ensure prosperity for all. This project contributes towards the following SDG(s):

  • SDG 4 - Quality Education
  • SDG 9 - Industry, Innovation, and Infrastructure
  • SDG 10 - Reduced Inequalities
  • SDG 16 - Peace, Justice and Strong Institutions
  • SDG 17 - Partnerships for the Goals


Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.