从ifstream读取不会读取空格-Readingfromifstreamwon'treadwhitespace

作者：执子之手2502891083 | 来源：互联网 | 2024-10-11 11:26

ImimplementingacustomlexerinC++andwhenattemptingtoreadinwhitespace,theifstreamwont

I'm implementing a custom lexer in C++ and when attempting to read in whitespace, the ifstream won't read it out. I'm reading character by character using >>, and all the whitespace is gone. Is there any way to make the ifstream keep all the whitespace and read it out to me? I know that when reading whole strings, the read will stop at whitespace, but I was hoping that by reading character by character, I would avoid this behaviour.

我正在用C ++实现一个自定义词法分析器,当试图读取空格时,ifstream将不会读取它。我正在使用>>逐字逐句阅读,所有的空白都消失了。有没有什么方法可以让ifstream保留所有的空格并将它读出来给我?我知道在阅读整个字符串时,读取将停留在空白处,但我希望通过逐字逐句阅读,我会避免这种行为。

Attempted: .get(), recommended by many answers, but it has the same effect as std::noskipws, that is, I get all the spaces now, but not the new-line character that I need to lex some constructs.

尝试:.get(),由许多答案推荐,但它与std :: noskipws具有相同的效果,也就是说,我现在获得所有空格,但不是我需要使用某些结构的新行字符。

Here's the offending code (extended comments truncated)

这是违规代码(扩展注释被截断)

while(input >> current) {
    always_next_struct val = always_next_struct(next);
    if (current == L' ' || current == L'\n' || current == L'\t' || current == L'\r') {
        continue;
    }
    if (current == L'/') {
        input >> current;
        if (current == L'/') {
            // explicitly empty while loop
            while(input.get(current) && current != L'\n');
            continue;
        }

I'm breaking on the while line and looking at every value of current as it comes in, and \r or \n are definitely not among them- the input just skips to the next line in the input file.

我正在打破while行并查看当前的每个值,而\ r或\ n肯定不在其中 - 输入只是跳到输入文件中的下一行。

8 个解决方案

#1

There is a manipulator to disable the whitespace skipping behavior:

有一个操纵器可以禁用空格跳过行为:

stream >> std::noskipws;

#2

The operator>> eats whitespace (space, tab, newline). Use yourstream.get() to read each character.

运算符>>吃空格(空格,制表符,换行符)。使用yourstream.get()读取每个字符。

Edit:

Beware: Platforms (Windows, Un*x, Mac) differ in coding of newline. It can be '\n', '\r' or both. It also depends on how you open the file stream (text or binary).

注意:平台(Windows,Un * x,Mac)在换行编码方面有所不同。它可以是'\ n','\ r'或两者。它还取决于您打开文件流(文本或二进制)的方式。

Edit (analyzing code):

编辑(分析代码):

After

  while(input.get(current) && current != L'\n');
  continue;

there will be an \n in current, if not end of file is reached. After that you continue with the outmost while loop. There the first character on the next line is read into current. Is that not what you wanted?

如果没有到达文件末尾,则会有当前的\ n。之后,继续进行最外面的循环。在那里,下一行的第一个字符被读入当前字符。这不是你想要的吗?

I tried to reproduce your problem (using char and cin instead of wchar_t and wifstream):

我试图重现你的问题(使用char和cin而不是wchar_t和wifstream):

//: get.cpp : compile, then run: get 

int main()
{
  char c;

  while (std::cin.get(c))
  {
    if (c == '/') 
    { 
      char last = c; 
      if (std::cin.get(c) && c == '/')
      {
        // std::cout <<"Read to EOL\n";
        while(std::cin.get(c) && c != '\n'); // this comment will be skipped
        // std::cout <<"go to next line\n";
        std::cin.putback(c);
        continue;
      }
     else { std::cin.putback(c); c = last; }
    }
    std::cout <

 
This program, applied to itself, eliminates all C++ line comments in its output. The inner while loop doesn't eat up all text to the end of file. Please note the putback(c) statement. Without that the newline would not appear.  
该程序适用于自身,它在输出中消除了所有C ++行注释。内部while循环不会占用文件末尾的所有文本。请注意回放(c)声明。没有它,换行就不会出现。 
If it doesn't work the same for wifstream, it would be very strange except for one reason: when the opened text file is not saved as 16bit char and the \n char ends up in the wrong byte... 
如果它对wifstream不起作用,那将是非常奇怪的,除了一个原因:当打开的文本文件没有保存为16位字符并且\ n字符以错误的字节结束时...

                        
                           
							  
							    #3
							    
							    
							      
4  
Wrap the stream (or its buffer, specifically) in a std::streambuf_iterator? That should ignore all formatting, and also give you a nice iterator interface. 
将流(或其缓冲区,特别是)包装在std :: streambuf_iterator中?这应该忽略所有格式,并为您提供一个很好的迭代器接口。 
Alternatively, a much more efficient, and fool-proof, approach might to just use the Win32 API (or Boost) to memory-map the file. Then you can traverse it using plain pointers, and you're guaranteed that nothing will be skipped or converted by the runtime. 
或者,一种更有效,更傻瓜的方法可能只是使用Win32 API(或Boost)来存储映射文件。然后你可以使用普通指针遍历它,并且保证运行时不会跳过或转换任何内容。
							     
							                          
                           
							  
							    #4
							    
							    
							      
2  
The stream extractors behave the same and skip whitespace.  
流提取器的行为相同并跳过空格。 
If you want to read every byte, you can use the unformatted input functions, like stream.get(c). 
如果要读取每个字节,可以使用未格式化的输入函数,如stream.get(c)。
							     
							                          
                           
							  
							    #5
							    
							    
							      
2  
Why not simply use getline ? 
为什么不简单地使用getline? 
You will get all the whitespaces, and while you won't get the end of lines characters, you will still know where they lie :) 
你会得到所有的空格,虽然你不会得到行字符的结尾,你仍然会知道它们在哪里:)
							     
							                          
                           
							  
							    #6
							    
							    
							      
2  
You could open the stream in binary mode: 
您可以以二进制模式打开流: 
std::wifstream stream(filename, std::ios::binary);
 
You'll lose any formatting operations provided my the stream if you do this. 
如果您执行此操作,您将丢失我提供的任何格式化操作。 
The other option is to read the entire stream into a string and then process the string: 
另一个选项是将整个流读取为字符串,然后处理字符串: 
std::wostringstream ss;
ss <
 
OF course, getting the string from the ostringstream rquires an additional copy of the string, so you could consider changing this at some point to use a custom stream if you feel adventurous. EDIT: someone else mention istreambuf_iterator, which is probably a better way of doing it than reading the whole stream into a string. 
当然,从ostringstream获取字符串需要额外的字符串副本,因此如果您有冒险精神,可以考虑在某些时候更改此字符串以使用自定义流。编辑:其他人提到istreambuf_iterator,这可能是比将整个流读入字符串更好的方法。
							     

							  
                        
                           
							  
							    #7
							    
							    
							      
0  
You could just Wrap the stream in a std::streambuf_iterator to get data with all whitespaces and newlines like this . 
您可以将流包装在std :: streambuf_iterator中以获取包含所有空格和新行的数据。 
           /*Open the stream in default mode.*/
            std::ifstream myfile("myfile.txt");

            if(myfile.good()) {
                /*Read data using streambuffer iterators.*/
    vector buf((std::istreambuf_iterator(myfile)), (std::istreambuf_iterator()));

                /*str_buf holds all the data including whitespaces and newline .*/
                string str_buf(buf.begin(),buf.end());

                myfile.close();
            } 

							     
							                          
                           
							  
							    #8
							    
							    
							      
-3  
I ended up just cracking open the Windows API and using it to read the whole file into a buffer first, and then reading that buffer character by character. Thanks guys. 
我最后只是打开Windows API并使用它首先将整个文件读入缓冲区,然后逐个字符地读取缓冲区。多谢你们。




    
        
                        c++
                        stream
                        char
                        go
                        string
                        io
                        get
                        ip
                        const
                    
    



    
        写下你的评论吧 !
        
            
                吐个槽吧,看都看了
            
            
                
                                        会员登录 | 用户注册
                                    
                
            
        

        
    

    
        推荐阅读
        
            
                                
                    
                        input
                        从 .NET 转 Java 的自学之路：IO 流基础篇
                    

                    
                                                
                            
                        
                                                
                        本文详细介绍了 Java 中的 IO 流，包括字节流和字符流的基本概念及其操作方式。探讨了如何处理不同类型的文件数据，并结合编码机制确保字符数据的正确读写。同时，文中还涵盖了装饰设计模式的应用，以及多种常见的 IO 操作实例。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-26 17:37:25
                    

                

                
                                
                    
                        input
                        编写有趣的VBScript恶作剧脚本
                    

                    
                                                
                        本文将介绍如何编写一些有趣的VBScript脚本，这些脚本可以在朋友之间进行无害的恶作剧。通过简单的代码示例，帮助您了解VBScript的基本语法和功能。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-28 09:46:23
                    

                

                                
                    
                    
                
                
                                
                    
                        ip
                        计算机图形学实训：OpenGL入门与直线光栅化算法
                    

                    
                                                
                        本教程涵盖OpenGL基础操作及直线光栅化技术，包括点的绘制、简单图形绘制、直线绘制以及DDA和中点画线算法。通过逐步实践，帮助读者掌握OpenGL的基本使用方法。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-26 12:24:25
                    

                

                
                                
                    
                        input
                        HTTP请求与响应机制详解
                    

                    
                                                
                        本文深入探讨了HTTP请求和响应对象的使用，详细介绍了如何通过响应对象向客户端发送数据、处理中文乱码问题以及常见的HTTP状态码。此外，还涵盖了文件下载、请求重定向、请求转发等高级功能。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-23 20:40:08
                    

                

                
                                
                    
                        input
                        深入理解KMP算法中的next数组：北大OJ 2406题解
                    

                    
                                                
                        本文详细探讨了KMP算法中next数组的构建及其应用，重点分析了未改良和改良后的next数组在字符串匹配中的作用。通过具体实例和代码实现，帮助读者更好地理解KMP算法的核心原理。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-28 11:30:01
                    

                

                
                                
                    
                        config
                        Python配置文件读写指南
                    

                    
                                                
                        本文详细介绍如何使用Python进行配置文件的读写操作，涵盖常见的配置文件格式（如INI、JSON、TOML和YAML），并提供具体的代码示例。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-28 08:39:55
                    

                

                
                                
                    
                        object
                        Java面试题解析
                    

                    
                                                
                        本文详细介绍了Java编程语言中的核心概念和常见面试问题，包括集合类、数据结构、线程处理、Java虚拟机（JVM）、HTTP协议以及Git操作等方面的内容。通过深入分析每个主题，帮助读者更好地理解Java的关键特性和最佳实践。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 13:55:14
                    

                

                
                                
                    
                        ip
                        UNP 第9章：主机名与地址转换
                    

                    
                                                
                            
                        
                                                
                        本章探讨了用于在主机名和数值地址之间进行转换的函数，如gethostbyname和gethostbyaddr。此外，还介绍了getservbyname和getservbyport函数，用于在服务器名和端口号之间进行转换。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 11:26:39
                    

                

                
                                
                    
                        window
                        Objective-C 编程中的关键语法点
                    

                    
                                                
                        本文探讨了 Objective-C 中的一些重要语法特性，包括 goto 语句、块（block）的使用、访问修饰符以及属性管理等。通过实例代码和详细解释，帮助开发者更好地理解和应用这些特性。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-26 19:42:38
                    

                

                
                                
                    
                        const
                        Weight the Tree（树形dp）
                    

                    
                                                
                        题目Link题目学习link1题目学习link2题目学习link3%%%受益匪浅！－－－－－&# ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-26 15:55:56
                    

                

                
                                
                    
                        php
                        CUGB图论专题：排水系统中的最大流问题 - EK与Dinic算法解析
                    

                    
                                                
                        本题探讨如何通过最大流算法解决农场排水系统的设计问题。题目要求计算从水源点到汇合点的最大水流速率，使用经典的EK（Edmonds-Karp）和Dinic算法进行求解。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-25 17:47:23
                    

                

                
                                
                    
                        input
                        Java基础：深入理解IO流
                    

                    
                                                
                        本文详细介绍了Java中的输入输出（IO）流，包括其基本概念、分类及应用。IO流是用于在程序和外部资源之间传输数据的一套API。根据数据流动的方向，可以分为输入流（从外部流向程序）和输出流（从程序流向外部）。此外，还涵盖了字节流和字符流的区别及其具体实现。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-25 00:37:31
                    

                

                
                                
                    
                        input
                        读取配置文件中的属性值
                    

                    
                                                
                        本文介绍了一种从与src同级的config目录中读取属性文件内容的方法。通过使用Java的Properties类和InputStream，可以轻松加载并获取指定键对应的值。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-24 14:48:35
                    

                

                
                                
                    
                        input
                        Servlet 表单处理：GET 和 POST 请求的深入解析
                    

                    
                                                
                        本文详细探讨了HTML表单中GET和POST请求的区别，包括它们的工作原理、数据传输方式、安全性及适用场景。同时，通过实例展示了如何在Servlet中处理这两种请求。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-23 18:09:59
                    

                

                
                                
                    
                        hook
                        解决Uploadify在IE浏览器中的兼容性问题
                    

                    
                                                
                        本文详细介绍了如何解决Uploadify插件在Internet Explorer（IE）9和10版本中遇到的点击失效及JQuery运行时错误问题。通过修改相关JavaScript代码，确保上传功能在不同浏览器环境中的一致性和稳定性。 ...
                        [详细]
                    
                    

                    
                        蜡笔小新   2024-12-27 22:07:40

















    

    
        
            
            
                
                
            

            
                执子之手2502891083            

            
                这个家伙很懒，什么也没留下！            


        
    

    
    

    
    

    
        Tags | 热门标签
        
            
                                
                    tags
                
                                
                    python2
                
                                
                    object
                
                                
                    js
                
                                
                    emoji
                
                                
                    vba
                
                                
                    php5
                
                                
                    uml
                
                                
                    cPlusPlus
                
                                
                    actionscrip
                
                                
                    const
                
                                
                    eval
                
                                
                    window
                
                                
                    command
                
                                
                    dockerfile
                
                                
                    schema
                
                                
                    iostream
                
                                
                    netty
                
                                
                    python3
                
                                
                    post
                
                                
                    ip
                
                                
                    blob
                
                                
                    config
                
                                
                    solr
                
                                
                    cSharp
                
                                
                    input
                
                                
                    hashcode
                
                                
                    hook
                
                                
                    php
                
                                
                    golang
                
                                
            
        
    

    
    
        
            
            
        
        RankList | 热门文章
        
            
                                
                    1jQuery删除元素remove()、detach()和empty()的对比分析
                
                                
                    2「智能化改造项目」配电室智能环境监测系统
                
                                
                    3springboot读取yml配置中的一个值
                
                                
                    4开发笔记:[从 0 开始的 Angular 生活]No.38 实现一个 Angular Router 切换组件页面
                
                                
                    5上传禅道包到html目录,cenos6.5下安装 禅道项目软件专业版
                
                                
                    6第一记： 搭建环境
                
                                
                    7深入研究Java内存模型(JMM)的进阶专题
                
                                
                    8二叉树（按层建立二叉树，前中后序以及按层遍历）
                
                                
                    9VC ico替换exe图标 不成功
                
                                
                    10马儿|攻击者_UEditor1.4.3.3的webshell漏洞攻击揭秘
                
                                
                    11Java 中的即时调整()方法，示例
                
                                
                    12剑指Offer面试题：22.二叉搜索树的后序遍历序列
                
                                
                    13用C语言编程最短路径,C语言实现图的最短路径Floyd算法
                
                                
                    14想问一下，iPhone的原彩显示需要开吗？
                
                                
                    15美团外卖手机上如何设置自动接单，苹果手机？:美团设置实付