MUL & SISU are committed not simply to corpus linguistics, but to critically analyze modern linguistic analysis techniques to execute a range of research inquiries in the different domains of social sciences. Both intend to focus on initial projects regarding the different fields of research including Language Data Science and Applications, Humanities/Social Sciences (in general), Linguistics and Literature, and English Language Teaching/Learning, (in particular).

MUL & SISU are steadfast to launch a training culture for a variety of new researchers in the field of social sciences. They will be motivated to learn these techniques in corpus studies. This practice will definitely pave grounds for having an excellent community of researchers in Pakistan as well as in China. It can generate sequence of success stories of corpus research in both sides and build an advanced domain: corpus-informed social sciences. 

The MUL & SISU concentrate on the analysis of huge naturally occurring language data('corpora'). In doing so, the use of computer technology like the automatic or computer-aided analysis can create harmony and systematicity in building advanced online archives in the different domains of knowledge for the researchers.


MUL & SISU focus on developing quality standards for several reuse scenarios such as Comparable Corpora, ‘Interpreted Corpora’ Specialized Corpora Multilingual Corpora, Learner Corpora’. Furthermore, other domains like, ‘Language Documentation’, ‘Sign Language’, ‘Language Community’, ‘Ethnography’, ‘Folk Literature and ‘Oral History’, minority, endangered, and indigenous languages. It will develop new methods and execute them in a range of real-world scenarios including the “Endangered Languages Documentation Program”. The project’s main aim is to maximize the potential for reuse and secondary use of audiovisual, annotated language and textual data for both sides researchers. 

Broadly speaking, the project consists of multiple research projects under the umbrella of this proposed Corpus Center. The memorandum of understanding (MOU) shall encompass all members at both centers to broaden their resource bases and mutual benefits. This will involve the mutual interest in the fields of research, development, education, training, and promulgation of knowledge in long term, and recognize the importance of the university’s role in the promotion of knowledge transfer and information sharing including the areas of mutual interests for collaboration. This will further magnify the possibilities of the mutual benefits of contemporary networking techniques, invigorate reciprocity, and disseminate investigative capacity, new methodologies, and technologies.


MUL & SISU will conduct researches by focusing on computational (Corpus) aspects of languages, specifically of Pakistan and other different parts of developing Asia including China. The center will also actively arrange and participate in seminars, workshops, and conferences dedicated to promoting language processing nationally and internationally. This work will be instrumental in the development of computing in different other domains of the Humanities and Social Sciences with special reference to linguistic perspectives. 

MUL & SISU are steadfast to launch a training culture for a variety of new researchers in the field of social sciences